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PREFACE 


This book has been written on the basis of lectures which I delivered 
at the department of mathematics and mechanics of Moscow State Uni- 
versity. In drawing up the program for my lectures, I proceeded on the 
belief that the selection of material must not be random nor must it rest 
exclusively on established tradition. The most important and interesting 
applications of ordinary differential equations to engineering are found 
in the theory of oscillations and in the theory of automatic control. These 
applications were chosen to serve as guides in the selection of material. 
Since oscillation theory and automatic control theory without doubt also 
play a very important role in the development of our contemporary tech- 
nical culture, my approach to the selection of material for the lecture 
course is, if not the only possible one, in any case a reasonable one. In 
attempting to give the students not only a purely mathematical tool 
suitable for engineering applications, but also to demonstrate the appli- 
cations themselves, I included certain engineering problems in the lectures. 
In the book they are presented in §13, 27, and 29. I consider that these 
problems constitute an integral organic part of the lecture course and, 
accordingly, of this book. 

In addition to the material presented in the lectures, I have included in 
the book more difficult problems which were investigated in student 
seminars. They are contained in §19 and 31. The material contained 
in §24, 25, and 30 was only partially presented in the lectures. For the 
convenience of the reader, in the last chapter, the sixth, are presented 
certain facts from linear algebra in the form in which they are used in 
this book. 

In closing, I wish to express my gratitude to my students and to my 
closest co-workers V. G. Boltyanskiy, R. V. Gamkrelidze, and E. F. 
Mishchenko, who helped me in the preparation and delivery of the lectures 
and in writing and editing this book. I want also to note the decisive in- 
fluence upon my scientific interests exerted by the outstanding Soviet 
specialist in the field of oscillation theory and automatic control theory, 
Aleksandr Aleksandrovich Andronoy, with whom for many years I have 
had a friendly relationship. His influence has substantially affected the 
character and direction of this book. 


Moscow L. 8. Pontryagin 
16 July 1960 
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CHAPTER 1 
INTRODUCTION 


This chapter is devoted primarily to the definition of those concepts 
which will be studied subsequently. What is a system of ordinary differ- 
ential equations, what do we mean by a solution of it, and how many of 
these solutions exist? These are the basic questions which we shall attempt 
to answer in this chapter. The number of solutions is determined by 
theorems of existence and uniqueness, which will not be proved here, but 
only formulated. The proofs of these and of a number of other theorems of 
the same type are given in the fourth chapter, but theorems previously 
formulated in the first chapter are repeatedly used, so that their meaning is 
thus clarified. In addition to such basic information, solutions of differen- 
tial equations of several of the simplest types are given in the first chapter. 
At the end of the chapter complex differential equations and their com- 
plex solutions are studied, and elementary facts concerning systems of 
linear differential equations are given. 


1. First-order differential equations. Equations in which the unknowns 
are functions of one or several variables and which contain not only the 
functions themselves, but also their derivatives, are called differential 
equations. If the unknown functions are functions of several variables, 
then the equations are called partial differential equations; in the opposite 
case, 1.e., for the case of functions of only one independent variable, the 
equations are called ordinary differential equations. In this book we shall 
deal only with the latter. 

In applications to physics the time is taken as the independent variable, 
which is conventionally designated by the letter t; throughout this book 
the independent variable will be designated by ¢t. Unknown functions will 
be designated by z, y, z,andso on. Derivatives of functions with respect to 
t will as a rule be designated by dots: = dz/dt, £ = d?x/dt?, and so on. 
When this is inconvenient or impossible, we shall denote the order of a 
derivative by an upper index in parentheses; for example, 2°” = d"x/dt”. 

First we shall study the first-order differential equation. This equation 
may be written in the form 


F(t, z, 2) = 0. (1) 


Here ¢ is the independent variable, « the unknown function, < = dz/dt the 

derivative, and F a given function of three variables. The function F 

need not be defined for all values of its arguments; therefore we speak of 
1 
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the domain of definition B of the function F or simply, the domain B of F; 
here we have in mind a domain in the space of the three variables z, y, z. 
Equation (1) is called a first-order equation because it contains only the 
first derivative < of the unknown function x. A function xc = ¢(t) of the 
independent variable it, defined on a certain interval r; < t < rg (the 
cases fT; = —%,fr2 = +o are not excluded), which, when substituted 
for x in equation (1), reduces (1) to an identity on the entire interval ry < 
t < ro, is called a solution of equation (1). The interval ry < t < ro is 
called the interval of definition of the solution g(t). It is evident that sub- 
stitution of x = ¢(t) in (1) is possible only when the function g(t) has a 
first derivative (and, in particular, is continuous) on the entire interval 
ry <€ < re. Yor the substitution of x = ¢(t) into equation (1) to be 
possible, it is also necessary that the point with coordinates (t, ¢(é), ¢(t)) 
belong to the domain B of the function F for any value of ¢ in the interval 
Ty <b <7. 

Relation (1) connects the three variables t, x, ~. In certain cases it deter- 
mines < as a single-valued, implicit function of the independent variables 
t, x. In this case (1) is equivalent to a differential equation of the form 


z = f(t, 2). (2) 


Equation (2) is said to be solved explicitly for the derivative; in certain re- 
spects it is more amenable to study than the general differential equa- 
tion (1). It is such explicit equations which we shall now study. We 
shall not assume that (2) has been obtained as a result of solving (1) for 2, 
but shall proceed from the function f(t, x) as a given function of the two 
independent variables 1, x. 

In order to visualize the situation geometrically, we introduce for study 
the iz-plane P. We shall plot t, as an independent variable, along the axis 
of abscissas, and x, as a dependent variable, along the axis of ordinates. 
The function f appearing in (2) need not be defined for all values of t 
and z, or, in geometric language, need not be defined at all points of the 
plane P, but only at points of a certain set T of P (Fig. 1). We shall assume 
that the set Tis a domain. This means that for every point p in I there is 
some circle of positive radius with center at p also contained in T. Con- 
cerning the function f, it will be assumed that both the function itself and 
its partial derivative, df/dx, are continuous functions of and rinT. A 
solution x = ¢() of equation (2) may be thought of geometrically in P 
as a curve with the equation x = ¢(t). This curve has a tangent at every 
point and lies entirely in the domain I; it is called an integral curve of the 
differential equation (2). 

Existence and uniqueness theorem. In algebra it is known that a large 
role is played by theorems which give the number of solutions to a given 
equation or system of equations. One such example is the fundamental 
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Figure 1 


theorem of algebra, which asserts that a polynomial of the nth degree always 
has exactly n roots (counted according to multiplicity). In exactly the 
same way, in the theory of differential equations the important theoretical 
problem is how many solutions the differential equation has. It turns out 
that every differential equation has a continuum of solutions and this is 
why the question to be posed does not concern the number of solutions, 
but rather how the set of all solutions of a given differential equation 
can be described. The answer to this question is given by the existence 
and uniqueness theorem (Theorem 1), which is presented without proof 
in this section. The proof will be given considerably later (see §20). 


THEOREM 1. Let 


t= f(t, x) (3) 


be a differential equation. We shall assume that the function f(é, x) is 
defined in a certain domain TI of the plane P of the variables ¢, x. We shall 
assume that the function f and its partial derivative of/dx are continuous 
in the entire domain I. The theorem asserts that 

(1) For every point (to, z9) of the domain I there exists a solution 
x = ¢(t) of equation (3) which satisfies the condition 


e(to) = 29; (4) 


(2) If two solutions r = ¢g(é) and x = X(t) of equation (3) coincide 
for one value t = ¢o, that is, if 


(to) = X(to), 


then these solutions are identically equal for all values of ¢ for which 
they are defined. 


The numbers fo, xo are called the inztial values for the solution x = ¢(t), 
the relation (4) represents the initial conditions for this solution, and we 
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1 a! 


——» t 


FIGURE 2 


shall also say that the solution x = ¢(t) satisfies the initial conditions (4) 
or that it has initzal values to, 2p. The assertion that the solution zt = ¢(t) 
satisfies the initial conditions (4) (or has initial values to, x9) assumes 
that the interval 7; < ¢ < rg, where the solution z = g(t) is defined, 
contains the point fo. 

Thus Theorem 1 asserts that the coordinates of any point (tg, x9) of the 
domain I are initial values for some solution of equation (3) and that two 
solutions with common initial values coincide. 

The geometrical meaning of Theorem 1 consists in the fact that through 
every point (to, Xo) of I’ passes one and only one integral curve of equation (3) 
(see Fig. 1). 

We have interpreted geometrically every solution z = y(t) of equation 
(3) in the form of the graph of the function y(t). We now give a geometric 
interpretation of equation (3) itself. Through every point (t, x) of T we 
shall draw a straight line J,,. with slope f(t, x). We obtain the direction 
field (or tangent field) corresponding to equation (3) and thus the geo- 
metric interpretation of this equation. 

The connection between the geometrical interpretation of the equation 
and the geometrical interpretation of its solutions consists in the fact 
(Fig. 2) that any integral curve x = ¢(t) is tangent to the straight line 
ligt) at each of its points (t, y(t). 


EXAMPLES 


1. To illustrate the significance of Theorem 1 (in this case, of its second 
part), we shall solve the differential equation 


z= az, (5) 
where a@ is a real number. Here 


f(t, x) = ax, 
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so that the function fin fact depends only on the variable x. The domain of 
f coincides with the entire plane P. Both the function f(t, 2) = ax and 
its derivative of(t,z)/dx = a@ are continuous functions of ¢ and z in the 
entire plane P. Thus Theorem | is applicable to equation (5). By direct 
substitution into equation (5) it is verified that each of the functions 


z= ce™, (6) 


where c is an arbitrary real number, is a solution of equation (5). We shall 
show that by assigning all possible values for c, we shall obtain all solu- 
tions of equation (5). Let zs = ¢(t) be an arbitrary solution of this equa- 
tion. We shall show that by proper choice of the number c we have 
o(t) = ce. Let to be a certain point of the interval of existence of the 
solution x = g(t), and let x9 = g(t). Let us assume that c = xe. 
Then the solutions x = g(t) and x = ce™* = ae*’—' of equation (5) 
have the same initial values (fo, 29), and therefore coincide by virtue of 
the second part of Theorem 1. Thus, formula (6) exhausts the set of all 
solutions of differential equation (5). 

2. We shall give a mathematical description of the process of decay of 
radioactive matter. The quantity of matter not yet decayed at the instant 
t we shall denote by z(t). Then the quantity of matter which has 
decayed over the small interval of time ¢ to ¢ + h is determined by the 
formula ehx(t), where a is a coefficient which depends on the properties 
of the radioactive matter and is slightly dependent on h; more accurately, 
it tends to a definite limit 8 as h — 0. Thus we have 


z(t) — a(t + h) = ahoit). 
Dividing this relation by h and passing to the limit as h — 0, we obtain 
4) = —fr(t). 


We see that the function x(é) satisfies the very simple differential equation 
examined in Example 1, so that 


z(t) = ce, 


To determine the constant c it is sufficient to specify any initial values. 
If, for example, it is known that at the instant ¢ = 0 there was a quantity 
of matter ro, then c = xo, and we have 


x(t) = aoe ™. 


The rate of decay is expressed here by the value 6 having the dimension 
1/sec or (sec) 1. Instead of the value 8, the rate of decay is often charac- 
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terized by the so-called half-life, ie., the time required for half of the 
existing matter to decay. We shall designate the half-life by T and establish 
the connection between the values 8 and 7. We have 


x _ 
3 = Loe ar . 
whence 
1 
T = =1n2. 
B 


2. Some elementary integration methods. The main problem facing us 
when we deal with a differential equation is the problem of finding its 
solutions. In the theory of differential equations, just as in algebra, the 
question of what it means to find the solution of an equation may be under- 
stood in various ways. In algebra the original aim was to find a general 
formula involving radicals for the solution of equations of any degree. 
Such were the formulae for the solution of a quadratic equation, Cardan’s 
formula for the solution of a cubic equation, and Ferrari’s formula for the 
solution of an equation of the fourth degree. Later, it was established that 
for equations of degree higher than the fourth, a general formula for solu- 
tion in radicals does not exist. The possibility remained of an approximate 
solution of equations with numerical coefficients and also the possibility 
of relating the dependence of the roots of an equation on its coefficients. 
The evolution of the concept of solution in the theory of differential] 
equations was approximately the same. The original aim was to solve, or, 
as it was said, “to integrate” differential equations by means of “quadra- 
tures,” i.e., the attempt was to write the solution in terms of the ele- 
mentary functions and their integrals. Later, when it became clear that a 
solution in this sense exists only for very few types of equations, main 
emphasis of the theory was transferred to the study of general laws of the 
behavior of solutions. In this section we shall develop integration methods 
by quadratures for certain first-order differential equations. 

(A) We shall solve the equation 


t= f(d, (1) 


the right-hand side of which depends only on the independent variable ¢. 
We shall assume that the function f(é) is defined and continuous on the 
interval r; < ¢ < ro. Under this assumption, equation (1) satisfies the 
conditions of Theorem 1, and the domain I for this equation is a strip in 
the tx-plane P which is determined by the inequalities ry < t < rg. Let 
to be an arbitrary point of the interval 7; < ¢t < rg; we assume 


eo = f f(r) dr. 


2) SOME ELEMENTARY INTEGRATION METHODS 7 


Figure 3 


The function go(t) is defined on the interval r; < i < rg. By inspection, 
an arbitrary solution of the equation (1) is given by the formula 


r= ot) = volt) + ¢, (2) 


where c is an arbitrary constant. The right-hand side of (2) is, as is known, 
the indefinite integral of the function f(t), so that (2) may be written in the 
form 


t= il f(t) dt. 


It is seen by direct inspection that the function (2) satisfies equation (1). 
Further, the graph of every solution (2) for an arbitrary c is obtained from 
the graph of the solution x = yo(¢) by using a vertical-parallel translation 
by the quantity c (Fig. 3). From this it is evident that through every 
point of T passes a curve defined by formula (2). Hence, by Theorem 1 it 
follows that (2) actually encompasses the set of all solutions of (1). 

(B) We shall solve the equation 


the right-hand side of which depends only on the unknown function z. 
We shall assume that the function g(z) is defined and has a continuous de- 
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rivative on the interval ay < x < ag. Then Theorem 1 is applicable to 
equation (1), and a strip in the tz-plane P which is determined by the 
inequalities a; < x < ag serves as the domain I. For the sake of sim- 
plicity, we assume in addition that on the interval a4, < x < ay the func- 
tion g(z) does not vanish and consequently does not change sign. Let zo 
be an arbitrary point of the interval aj < x < a2; we assume 


_ fa. 
Go(z) = [ g(é) (4) 


The function G(x) is defined on the interval a, < x < ag, and its deriva- 
tive on this interval is never zero; therefore the function Go(x) has an 
inverse, i.e., there exists a function ¥(t) such that 


Go(Wo(t)) = t. (5) 
Consequently, an arbitrary solution of equation (3) is given by the formula 
z= ¥(t) = volt — °), (6) 


where c is an arbitrary constant. The function y(t) is monotonic and 
assumes all values belonging to the interval ay < x < dg. 

We shall first prove that the function (6) is a solution of equation (3). 
From (5) it follows that 


Go¥®) = Go(volt — c)) =t—e. (7) 


Differentiating this relation with respect to t, we obtain 


o¥OW® = 1, 
hence [see (4)] 
v(t) = gv). 


Since the function Wo(¢) is obtained as the inverse of the monotonic func- 
tion Go(x), which is defined on the entire interval ay < x < ag, the func- 
tion Wo(t) [and consequently y(£)}] is monotonic and assumes all values 
on the interval ay < x < dy». Since, further, the integral curve (6) is ob- 
tained from the curve zp = yo(t) by a horizontal-parallel translation 
(Fig. 4), a curve of the form (6) passes through every point of the strip I. 
Thus by Theorem 1, (6) contains the set of all solutions of equation (3). 
Note. The relation (7) shows that the function y(t) is the inverse of the 
function Go(x) + c, which is the indefinite integral of the function 1/g(z). 
Thus all solutions x = y(t) of equation (3) are described by the formula 


dx 
[= 6) 
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Figure 4 


If the function { = G@o(x) + c is taken as the unknown function, then we 
obtain for it the differential equation 


de 
dx” g(x)’ 


which is equivalent to equation (3). It is solved by the method presented 
in (A), which gives (8). 
(C) We shall solve the equation 


& = fdg(), (9) 


which is called an equation with separable variables. We shall assume that 
the function f(t) is defined and continuous on the interval r; < t < ro 
and that the function g(x) is defined and has a continuous derivative on the 
interval ay < x < ag. Then Theorem | is applicable to equation (9), 
and the rectangle determined by the inequalities 


11) <t <1, a, <x < Ge 


serves as its domain I. For the sake of simplicity, we shall assume that 
g(x) does not vanish on the interval ay < x < ag. For the solution of (9) 
we form two auxiliary equations: 


mm — §(0, (10) 
a = (2). (1) 


Equations (10) and (11) are solved by the rules given in (A) and (B). Let 
u = ¢o(t) be some solution of equation (10) and x = Wo(u) some solution 
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of equation (11) defined on by < u < be and assuming all values belong- 
ing to the interval aj < x < ag [see (B)]. Hence an arbitrary solution 
of equation (9) can be written in the form 


t= X(t) = Wo(vo(t) — ¢), (12) 


where c is an arbitrary constant. The interval of variation of ¢ for every 
solution (12) must be such that values of u = ¢go(t) — ¢ belong to the 
interval b} < u < de. 

We shall prove first that the function (12) is a solution of (9). Since the 
function Wo(u) satisfies (11), ¥o(vo(t) — c) = glvo(vo(t) — ¢)] = g(x(t)). 
Further, o(t) = f(é [see (10)]. Differentiating equation (12), we obtain 


X() = Wolvo(t) — eloolt) = g(x. 


Thus the function + = X(Z) satisfies (9). 

We shall now show that formula (12) gives all solutions of equation (9). 
Let (to, %o) be an arbitrary point of T. Since the function + = ypo(u) 
assumes all values on the interval ay < r < dg, there exists a value ug 
such that Yo(uo) = ro. Let us set c = go(fg9) — uo; then by (12) we 
have X(f9) = Wo(uo) = Xo. Thus through every point (fo, 19) of I’ passes 
a curve of the form (12) and, by Theorem 1, formula (12) contains all 
solutions of equation (9). 

Note. Vormula (12) may be written in the form 


dx 
ice 7 [ 10 a 
[compare (A) and (B)]. 


(D) We shall solve the equation 
1 a, 
yah (3) (13) 


in which the right-hand side depends only on the ratio of the variables y 
and t. Such an equation is called homogeneous. We shall assume that h(x) is 
defined and has a continuous derivative on the interval a4, < x < ay. 
Then Theorem 1 is applicable to equation (13), and I consists of all points 
of the ty-plane P satisfying the inequality 


a<t<a (0). 


Tor the sake of simplicity, we also assume that on the intervala; < zx < de 
the function h(x) — x does not vanish. We shall solve (13) by making a 
change of variables, that is, instead of the unknown function y we shall 
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introduce an unknown function x by setting 
y = ct. (14) 


By replacing the function y in equation (13) by its expression (14) we 
obtain for the new unknown z the equation 


dt +2 = A(x), 


or, what is the same thing, 


This equation is an equation with separable variables and may be solved 
by the method shown in (C). 
(E) We shall solve the equation 


y = altjy + bit). (15) 


This equation is called Iznear, because the unknown function and its 
derivative appear linearly in it. We shall assume that the functions a(t) 
and b(é) are defined and continuous on the interval r; < t < reg. Then 
for equation (15) Theorem 1 is applicable, and the domain I is defined by 
the inequalities 


ry <b <r. 


If b(é) = 0, then equation (15) is called homogeneous. 
To the nonhomogeneous equation (15) corresponds the homogeneous 


equation 
t= az, (16) 


which we shall study first. Equation (16) is an equation with separable 
variables and it can be solved by the method shown in (C). However, 
the function g(x) in the given equation is equal to x and can vanish. 
Therefore, for the solution of equation (16) it is necessary to investigate 
separately the domains z > 0 and x < 0 and also the solution + = 0. 
Instead, we shall write the solution directly by means of the formula 


x = cexp Lf a(r) dr|; (17) 


where r; < to < rg andc is an arbitrary constant. Substitution of (17) 
into (16) shows that (17) gives a solution; we shall show that it contains 
the set of all solutions. Let (9, 29) be an arbitrary point of the strip 
ry < ¢ < rg. In order that the solution (17) have the initial values (Jo, xo) 
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it is sufficient that the constant ¢ satisfy the condition 
do 
Xp = cexp Lf a(T) dr|) 
to 


from which ¢ is uniquely determined. 
The nonhomogeneous equation (15) is solved by the method of variation 
of parameters. We sct 


y = cexp [fam dr] (18) 


and assume that cis not a constant quantity but depends on ¢. Substitution 
of (18) into (15) gives 
t 
ée Tt) dr| = bid). 19 
xp| fi a(r) dr] = b@) (19) 
Thus 


é = b() exp [-/ a(r) dr]. 


This differential equation for c is solved by the method presented in (A). 

The method of variation of parameters is in this case a method of 
introduction of anew unknown. That is, instead of the unknown function y, 
we introduce, by (18), a new unknown function c. By finding an arbitrary 
solution of (19), we also find an arbitrary solution of (15) by formula (18). 


EXAMPLES 


1. We shall solve the equation 


. 2 
t=p 7° (20) 
Here, as in (A), the right-hand side, 
2 
0 =p’ 


depends only on the independent variable ¢, but it has discontinuities at 
the points é = landt = —1. 

Thus the domain I in the ¢tz-plane P consists of a strip, —1 < t < 1, 
and two half-planes,  < —landt > 1. In order to solve (20) by the 
rules indicated in (A), it is necessary to divide the interval —o0 <t<-+oa 
into three intervals, —»2 <t< —1,—l1 <t<1,1<t< +0, and 
in each of them to take the indefinite integral of f(t). To carry out the 
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Figure 5 


integration, we expand the function f(1) into partial fractions: 
a 
~t—-1 t4+1 


For each of the three intervals, the solution of equation (20) is written in 
the form 


1 1 
s=f{(44-ADe 


In [é — 1] —In|[t+ 1] +e 


I 


= In 


real te 21) 
As was indicated in §1, a continuous function which satisfies the equation 
and is defined on the interval is called a solution of the differential equation; 
thus formula (21) for a fixed c defines not one but three solutions of 
equation (20), the first of which is defined on the interval —x <t< —l, 
the second on the interval —1 < ¢ < 1, and the third on the interval 
1<t< om (Vig. 5). 
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2. We shall solve the equation 
g=—-y>—: (22) 


The right-hand side here depends only on 2; it is defined, continuous, and 
differentiable for all values of x, and for this reason the domain I, for equa- 
tion (22), coincides with the entire plane P. The function 


2 
zw — 1 
g(x) —_ 2 ’ 


on the right-hand side of equation (22), vanishes at the points z = —1 
and x = 1; for the solution of equation (22) according to the rules in- 
dicated in (B), therefore, it is necessary to split T into the three domains 
determined by the inequalities 


—-o <x<—-l, -l<a<+H, LlL<2x<ow, — (23) 


and in addition to investigate the obvious solutions x = 1, x =+1. In 
each of the domains (23), the solution x = y(t) is determined implicitly 


from the equation 
2dx 
17% 


x— 1 


x+1 


This equation is equivalent to the equation 


or else from the equation 


ln 


bate. 


= Co2é ‘ (24) 


where ¢2 is an arbitrary positive constant. On the intervals —0% <2 < 
—landl < x < o equation (24) may be written in the form 


ea1_ oye 
xr +] — 20 5 
and on the interval —1 < x < 1 in the form 
er d_ age. 
z+l- wo 
both cases can be embraced by a single formula 
oe ce’, (25) 


zc+l 
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where ¢ is an arbitrary constant other than zero. Solving equation (25) 
for «, we have 

_ L+ee! 
~ T= eet’ 


x (26) 


where c ~ 0. 

For a fixed negative c formula (26) gives one solution, located in the strip 
~1 <2 <1. Fora fixed positive ¢ formula (26) defines two solutions, 
of which one, defined on the interval —«2 < t < — Ine, is located in the 
half-plane x > 1, and the other, defined on the interval —Ine < t < », 
is located in the half-plane x < —1. In addition to the solutions located in 
the domains (23), equation (22) has two more solutions, x = landz = —1, 
which are obtained formally from (26) for ¢ = 0 and c = o. Since one 
of the solutions found (I‘ig. 6) passes through every point of P, it follows 
from Theorem | that other solutions do not exist. Thus formula (26) 
includes the set of all solutions of equation (22) if c is allowed to take all 
real values, including a. 

3. Example 2 is a typical one for an equation of the form 


& = g(z), (27) 


where g(x) is a continuously differentiable function defined for all values z, 
—« <x < o. Theorem 1 is applicable to equation (27), [ being the 
entire plane 7. We note first that, if a is a zero of the function g(z), that is, 
if g(a) = 0, then the function z = a is a solution of equation (27). Thus 
to every zero of g(x) corresponds a solution whose graph is a horizontal 
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straight line 2 = a in the plane P. For the sake of simplicity, we shall 
assume that the zeros of g(x) do not have accumulation points. Then P 
will be decomposed by solutions of the form z = a into a series of strips, 
including possibly one or two half-planes of the form x < a’ andz > a”. 
In each of these strips equation (27) may be solved by the method in- 
dicated in (B). 

We shall note certain essential properties of these solutions. Let a; 
and az, a, < G2, be two successive zeros of g(x), so that on the interval 
a, <x < ag the function g(x) does not vanish. Let x = o(t) be any 
solution of (27) in the domain a, < x < ag. As was noted in (B), all 
other solutions in this domain are obtained by the formula x = yo(t — c), 
i.e., by a horizontal-parallel translation of the solution W(t). Since on the 
interval a; < x < de the function g(x) does not vanish, the solution 
Wo(t) is monotonic. In order to be definite, we shall assume that g(x) > 0 
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for a, < x < dg. Then the solution yo(t) is a monotonically increasing 
function. I'rom Theorem 1 it is comparatively easy to derive the fact that 
the solution W(t) can be continued to the interval —a < i < o, whence 


lim o(t) = a1, lim po(t) = ae. 
ti—a t++o 


We shall now describe the behavior of the solutions in the domain x > a, 
where a is the maximum zero of the function g(x) (if such a value exists). 
It is obvious that g(x) # Ofor x > a; to be precise, we shall assume that 
g(x) > Ofor x > a. Let Yo(t) be some solution of (27) passing into the 
domain x > a. By virtue of (B) all solutions in this domain are given by 
the formula x = Wo(t — c). From Theorem | it is comparatively casy to 
derive the fact that the solution y9(£) can be continued over the interval 
—o <t <r, whence 


lim ¥o(t) = a, — lim Yo(t) = +20; 
t+—2 tor 


here r may assume finite values or the value +o. Thus in the half-plane 
x > a two distinct cases are possible: every solution is defined on the 
interval —w#2 < t < +o or every solution is defined on the interval 
—# <t <r, where the number r is finite and depends on the solution 
selected. Since a solution of the form described above passes through 
every point of the plane P (J'ig. 7), the solutions described above consti- 
tute, by Theorem 1, the set of all solutions of equation (27). 

4, We shall show that, if the right-hand side of the equation does not 
have a continuous derivative, then the second part of Theorem 1 (unique- 
ness) need not apply. Let us consider the equation 


& = 37/3, (28) 


The right-hand side of equation (28) is defined and continuous for all values 
of x, but its derivative 27/3 has a discontinuity at the point = 0. If 
the set of all points for which x ~ 0 is taken as the domain I, then Theo- 
rem 1 is applicable to equation (28) in this domain, and in each of the 
half-planes « > 0, x < 0, equation (28) can be solved by the method 
indicated in (B). Solving equation (28) by this method, we obtain 


V3 ¢—¢, (29) 


Part of the graph of the function (29) (for £ < c) isin the half-plane x < 0 
and part (fort > c) isin the half-planez > 0. It is immediately verified, 
however, that the function (29) [that is, x = (t — c)*] isa solution of (28) 
for all values of ¢ in the interval —o < t< +o. At the same time, 
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x = Ois also a solution of equation (28). Thus through every point z = 0, 
t = cof the straight line x = 0 pass two solutions (I'ig. 8): the solution (29) 
and the solution x = 0. We sce that the second part of Theorem 1 (unique- 
ness) does not hold for equation (28). 


3. Formulation of the existence and uniqueness theorem. In §1 we 
studied a first-order differential equation and formulated an existence and 
uniqueness theorem for this equation. The theory of ordinary differential 
equations deals also with more general systems of equations. Usually a 
system of ordinary differential equations consists of as many equations as 
there are unknown functions in the system; in this connection, all the un- 
known functions are functions of one and the same independent variable. 
In all cases the existence and uniqueness theorem is the fundamental 
theoretical proposition which makes it possible to approach the study of a 
given system of differential equations. 

The existence and uniqueness theorem is formulated and proved for a 
system of equations which superficially appears to be of a rather particular 
type. But in fact, systems of comparatively general type reduce to this 
system of equations. Systems of differential equations of the particular 
type discussed here we shall in the future call normal, although this term 
is by no means generally accepted. 
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The system 
a = fi(t,c',2?,..., 2"), t= 1,...,7, (1) 


of ordinary differential equations is called a normal system. In this system 
t is the independent variable, x!,..., x2” are the unknown functions, and 
fi,...,f" are functions of n + 1 variables defined in a certain domain T 
of the (n + 1)-dimensional space of the variables t,x!,...,2". Here 
and in the sequel, a “domain” of cuclidean space is understood to be an 
open set, i.c., a set which together with every point contains a certain 
sphere of positive radius with its center at this point. It will always be 
assumed that the functions 


fit, v1,27,...,2"), t=1,...,n, (2) 


are continuous in the domain I; it will also be assumed that their partial 
derivatives 


afi(t, x1, 27,..., 2") 


5 > ag=de..yn, (3) 


are continuous in T. It should be noted that the partial derivatives (3), 
the continuity of which is assumed, are taken only with respect to 
z',...,2#7, but not with respect to the independent variable t. 

In the presently fashionable theory of ordinary differential equations, 
the continuity of the derivatives (3) is not usually required, and this 
requirement is replaced by a weaker one, namely a Lipschitz condition for 
the functions (2). I consider that this generalization is nonessential and 
there is no need to dwell on it (this condition will be formulated below). 

By a solution of system (1) we shall mean a system of continuous func- 
tions 

v—gi(t), iti=]1,...,n, (4) 


which are defined on some interval r; < t < re and which satisfy (1). 
The interval r; < t < re is called the interval of definition of the solution 
(4) (the cases rr} = —x%, rg = +0 are not excluded). It is assumed that 
the system of functions (4) satisfies the system of equations (1) if after 
substitution of the functions (4) into (1) in place of z!,..., 2”, the rela- 
tions (1) reduce to identities in ¢ on the entire interval r; < t < rg. For 
this substitution to be possible, it is necessary that the functions (4) have 
derivatives at every point of the interval r; < ¢ < rg and that the right- 
hand sides of (1) be defined for all values of their arguments. Thus the 
point with coordinates 
t,el(t),..., e(t) 


must belong to [ for all values of ¢ in the interval ry < t < rg. We shall 
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now give a formulation of the existence and uniqueness theorem for the 
normal system (1). 


THEOREM 2. Let (1) be a normal system of ordinary differential equa- 
tions. Here the right-hand sides of equations (1) are defined in a certain 
domain I, and the functions (2) and (3) are continuous in this domain. 
Then, for every point 

to, x, x, coe XO (5) 


of I, there exists a solution 
vi=e¢i(), t=1,...,2, (6) 


of the system (1), which is defined on some interval containing the point 
to and which satisfies the conditions 


¢'(to) = 26, i= 1,...,7 (7) 
Further, if there exist two solutions 
x = y*(0), t#=1,...,27, 


xt = x*(t), a 


(8) 


I 
com 
3 


of (1) which satisfy the conditions 


¥'(to) = x*(to) = x6, t= 1,...,7, (9) 


each defined on an interval of values of ¢ containing to, then these solu- 
tions coincide wherever both are defined. 


The values (5) are called initial values for the solutions (6) and (8), and 
(7) and (9) are called inztial condittons for these solutions. We shall say 
hereafter that our solutions have the initial values (5) or satisfy the initial 
conditions (7) and (9). Thus the existence and uniqueness theorem for a 
normal system can be formulated briefly as follows: 


For any initial values (5) there always exists a solution of system (1) 
with these initial values, which is defined on a certain interval containing 
the point ¢. Further, if there are two solutions with identical initial 
values (5), each of which is defined on a proper interval containing fo, 
then these solutions coincide on the common part of these intervals. 


The existence and uniqueness theorem permits us to formulate and solve 
the problem of the maximum interval of existence of a solution with given 
initial values. In order to attack this problem we shall assume that we 
have two solutions with the same initial values (5), the first solution being 
defined on the interval r,; < ¢ < re and the second solution on the interval 


3] THE EXISTENCE AND UNIQUENESS THEOREM 21 


8, < ¢< 8s». Both these intervals contain the point fp, that is, they in- 
tersect. If one of the intervals were contained in the other, for example, 
the first one in the second, then consideration of the first would be super- 
fluous, since, by virtue of uniqueness, the first solution coincides in this 
case with the second wherever it is defined. It can happen, however, that 
neither of the two intervals is contained in the other; for example, it may be 
thatr; < sy < re < 8». In this case, the uniqueness implies that the two 
solutions coincide on the interval s; < ¢ < rg; the first solution is defined 
only on the half-interval r; < t < s; and the second only on the half- 
interval re < t < sg. On the interval r; < ¢ < se, which contains both 
original intervals, no solution has as yet been formally defined. This solu- 
tion, however, is very easy to define. It is only necessary to consider 
that on the interval r; < ¢ < re this solution coincides with the first one, 
and on the interval s; < ¢ < s» it coincides with the second. Thus we 
arrive at a solution defined on the interval r; < ¢ < s_ which contains 
both of the original intervals, and there is no need to investigate the 
original solutions which are defined on the subintervals of the interval 
ry <t < 8. It is evident that if there is a finite number of solutions with 
common initia] values, then it is possible to construct the solution on the 
interval which contains all the intervals of existence of the original solu- 
tions. Also, all solutions with common initial values can be easily con- 
solidated into one, and a concept of the maximum interval of existence of 
a solution with given initial values can be attained. We shall formulate 
and prove the corresponding proposition: 
(A) For any initial values (5), there exists a solution 


z= g(t), «t= 1,...,2n, 


of the system (1), defined on a certain interval m, < ¢ < mg, such that 
for any solution x = ¢'(t),i = 1,...,n, of system (1) with initial values 
(5), its interval of definition 1s contained in the interval m, < t < mg. 
By virtue of the uniqueness theorem, the solution z' = ¢'(t) coincides 
with the solution x‘ = $*(t) along its entire interval of definition, and 
for this reason there is no need to consider it. The interval m; < t < me 
will be called the maximum interval of existence for the initial values (5). 

We shall prove the existence of a maximum interval for given initial 
values (5). To each solution of system (1) with initial values (5) corre- 
sponds its own interval of definition. The set of all right-hand endpoints 
of these intervals we shall denote by Re and the set of all left-hand end- 
points by R,. The upper bound of the set R2 we shall denote by me (in 
particular, mz may be +), and the lower bound of the set R, we shall 
denote by m, (in particular, m,; may be —o). We shall now show that 
m, < § < mz is a maximum interval for the initial values (5). We shall 
first construct the solution x' = ¢*(t), 7 = 1,..., 7, with initial values 
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(5) which is defined on the interval m; < t < me. Let t* be an arbitrary 
point of this interval, that is, m,; < t* < mg. To be precise we assume 
that {* > to. Since m2 is an upper bound of the set Re, there exists a 
solution x* = y(t), = 1,..., 7, of (1) with initial values (5), whose in- 
terval of definition r; < t < r2 contains the point ¢*, and we set ¢*(t*) = 
yi(t*), i= 1,...,n”. The value obtained for the solution ¢‘(t) at the 
point ¢ = ¢* does not depend on the choice of the solution x‘ = y(t), 
i= 1,...,n. Indeed, if instead of the solution z' = y(),7 = 1,..., n, 
we took 2? = x*(t), 4 = 1,..., 7, with initial values (5) and interval of 
definition s; < t < 82 which also contains the point t*, then by virtue of the 
uniqueness (see Theorem 2) we would have y‘(é*) = x*(t*),¢ = 1,..., 7. 
Thus the functions ¢'(t), 7 = 1,..., 7, are uniquely defined on the entire 
interval m; << my. At the same time they constitute a solution of 
(1) with initial values (5). Indeed, near each point ¢* of the interval 
m, < t < mothe system of functions ¢*(t), 7 = 1,...,n, coincides by 
construction with some solution of (1), and for this reason the functions 
g(t), i= 1,..., , are continuous and constitute a solution of (1). It 
remains to be shown that the interval m; < t < m2 is maximal. Let 
z’ = gi(t),i = 1,...,7, be a certain solution of system (1) with initial 
values (5) which is defined on the interval r; < ¢ < re. Then ry is an 
element of the set R, and rg is an element of the set Re, and therefore 
m, < 11, T2 < Mo, that is, the interval ry < ¢ < rg is contained in the 
interval my < t < mo. 

We shall formulate here without proof one more existence theorem, 
which we shall later obtain as a simple corollary of Theorem 2. 


THEOREM 3. Let 


f= > ai(t)z? + dt), @—=1,...,0 (10) 


j=1 
be a normal linear system of equations. Here the coefficients ai(t) and 


the free terms b*(t) are continuous functions of ¢ defined on a certain 
interval gq; < ¢ < qe. Then for arbitrary initial values 


to, XO, x6, oe , XO, q1 < to < q1) (11) 


there exists a solution of (10) with these initial values which is defined 
on the entire interval gq, < t < qe. 


In other words, the maximum interval of existence of the solution of 
the linear system (10) is the entire interval q; < t < qo [for the arbitrary 
initial conditions (11)]. 

For the case when the coefficients and free terms of system (10) are de- 
fined on the entire straight line, i.e., when qi = —o, Gg = +o for any 
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initial values, there exists a solution of (10) defined on the entire infinite 
interval —o < t < +o. 

The solutions of a normal system (1) can be interpreted geometrically 
as integral curves in the (n+ 1)-dimensional space with coordinates 
t, z!,..., 2" (compare §1). The equations of an integral curve have the 
form 

z= eit), i=1,...,n, (12) 


where (12) is a solution of the system. 
The system (1) itself may be interpreted with the aid of the direction 
field in an (n + 1)-dimensional space (compare §1). 


EXAMPLES 
1. We shal! solve the normal linear system of equations 
z= —oy, y =r. (13) 


The domain I for this system is the entire space (é, x, y). By direct inspec- 
tion, we see that the system of functions 


x = c, cos (wt + Co), y = c,sin (wt + co), (14) 


where c, and Cz are arbitrary constants, is a solution of (13). To show that, 
by proper selection of the constants c, and cs, an arbitrary solution can be 
obtained from formula (14), we shall prescribe initial values to, 29, yo and 
show that among the solutions (14) there is a solution with these initial 
values. We obtain for the constants c, and c, the conditions 


C1 COS (wtp + C2) = Xo, c, sin (wlo + co) = Yo. (15) 
Let p and ¢ be polar coordinates of the point (29, yo), so that 
Lo = pcosg, Yo = psing¢. 
Then equations (15) may be rewritten in the form 
C1 cos (wlo + C2) = pcos ¢, C1 Sin (wlo + co) = psing. 


By setting 
ci = Pp, C2 = ¢ — aly, 


it is evident that we shall satisfy conditions (15). Thus, through every 
point (to, Yo, yo) passes a solution given by formula (14). By virtue of 
Theorem 2 (uniqueness), formula (14) contains the set of all solutions. 
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2. Let us solve the equation 

z= x’ cost (16) 

with separable variables. The domain I for this equation is the entire 


plane (t,z). For z > 0 and for x < 0 this equation can be solved by the 
method presented in (C) of §2. For each of these half-planes we have 


de i cos t dt, 
& 


= sint — c. 


or 


| 
a 


Thus we obtain 


1 
~ ¢—sint (17) 
In addition to the solutions described by this formula, we have the trivial 


solution 
z= 0. (18) 


We shall show that formulas (17) and (18) include the set of all solutions 
of equation (16). Let (fo, x9) be arbitrary initial values. If x9 = 0, then 
the solution (18) has these initial values. If ro ~ 0, however, then the 
constant c has the value 

c= sin to + I ‘ 

B20) 

The solution (18) is defined on the interval (— 0,00), and this interval 
is the maximum interval of existence of the solution for the corresponding 
initial values. In exactly the same way, for |c| > 1 formula (17) deter- 
mines one solution defined on the interval (— 00, 00), and this interval is 
maximal. For a fixed constant c which satisfies the inequality |e] < 1, 
formula (17) gives not one solution, but an infinite set of solutions. Each 
individual solution in this case is defined on the interval r) < t < ro, 
where r; and rg are two consecutive zeros of the function sin? — c. This 
interval is the maximum interval of existence of the solution for the 
corresponding initial values, for, as ¢ approaches the endpoints of the 
interval, the function (17) tends to infinity (T'ig. 9). 

3. We shall show that if the right-hand sides (2) of the system of equa- 
tions (1) are & times continuously differentiable, i.e., have continuous 
derivatives of the kth order (including derivatives of mixed type) with 
respect to all variables ¢, z!,..., 2x”, then the (k + 1)st derivative of the 
solution (4) of (1) exists and is continuous. 
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Indeed, for (4), we have the identity 
e(t) = fit, e(),..., (0), t=1,...,n7. (19) 


If the right-hand sides (2) have continuous first derivatives, then the right- 
hand side of (19) has a continuous derivative with respect to ¢, so that 
g(t) exists and is continuous. By differentiating (19) k times, we establish 
(in order) the existence and continuity of all derivatives of the 2nd, 
3rd,..., (& + 1)st order of the functions ¢‘(t). 


4, Reduction of a general system of differential equations to a normal 
system. In the preceding section we formulated an existence and unique- 
ness theorem for a normal system of differential equations. Here it will 
be shown how quite general systems of differential equations may be 
reduced to normal systems, and at the same time an existence and unique- 
ness theorem will be established for these general systems. 
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We shall first give the concept of a system of differential equations in 
the general form. 

In the case of one unknown function x of an independent variable ¢ it is 
customary to consider one equation, which can be written in the form 


F(t,2,¢,...,2°”) = 0. (1) 


Here ¢ is an independent variable, x is the unknown function, and F is a 
given function of n + 2 variables. The function F need not be defined 
for all values of its arguments; for this reason we speak of the domain of 
definition B of the function F. Here we have in mind the (n + 2)-dimen- 
sional domain B in which the coordinates of a point are the variables 
t, 2, ¢, ..., 2°”. If the highest-order derivative entering the differential 
equation is equal to n, then we speak of an nth-order equation. By a 
solution of equation (1) we shall mean a continuous function x = ¢(t) of 
the variable ¢, which is defined on a certain interval r; < 4 < re, such 
that substitution of ¢(é) for 2 in (1) reduces (1) to an identity in ¢ on the 
interval 7; < ¢ < ro. It is clear that substitution of x = ¢(t) in (1) is 
possible only when ¢(t) has derivatives up to the nth order, inclusive, on 
the entire interval of existence ry; < { < rg. For the substitution of 
x = ¢(t) into relation (1) to be possible, it is also necessary that the point 
with coordinates {£, y(t), o(t),..., ¢°()} belong to the domain B of F 
for any ¢ in the interval r; < t < ro. 

If there are two unknown functions of one independent variable, then 
two differential equations are considered which together form a system 
of equations. This system can be written in the form 


F(t, x, £, sty a”, y, y, sty ™) = 0, (2) 
Git, a, &, sey a™, UD y, sey y™) = 0. 


Here ¢ is an independent variable, + and y are the two unknown functions, 
and F and G are two functions, each of m + n + 8 variables, which are 
defined in a certain domain B. If the highest-order derivative of x entering 
into (2) is equal to m, and the highest-order derivative of y entering into 
(2) is equal to n, then the number m is called the order of the system (2) 
with respect to x, the number n is the order of the system (2) with respect 
to y, and the number m + nis called the order of the system (2). A pair of 
continuous functions t = g(t) and y = w(é) defined on a certain interval 
ry < t < re will be called a solution of (2) if they have the property that 
substituting them into (2) reduces (2) to an identity in ¢ on the entire 
interval r; < ¢ < rg. As in the case of one equation, it is assumed that 
conditions are satisfied which make possible the substitution of « = ¢(t), 


y = ye) into (2). 
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Systems of differential equations with three or more unknown functions 
of one independent variable are defined similarly. If the unknown func- 
tions of a system of differential equations are the functions z',... , x”, and 
the highest-order derivative of x‘ entcring the system is q;,i = 1,...,%, 
then the number gq; is called the order of the system with respect to x‘, and 
the number ¢g = q; + q2 +°+-- + qa is called the order of the system. 
Thus, the normal system (1) of §3 is of order n. 

If (1) can be solved for x”, then equation (1) may be rewritten in the 


form 
a™ — fit,a,é,...,2"7?), (3) 


In exactly the same way, if (2) can be solved for x” and y, then this 
system can be rewritten in the form 


(m—1 


(m) . ) (a—-l) 
om = f(t,z,4,...,2 re ), 


(n) (4) 


y™ = g(t, 2, 4,..., 2% 


» y, y, Leey y*—), 

Equation (3) and system (4) are said to be solved for the highest derivatives. 
Systems with an arbitrary number of functions which can be solved for the 
highest derivatives are defined analogously. In particular, the normal 
system (1) of §3 can be solved for the highest derivatives. Later we shall 
concern ourselves almost exclusively with systems which can be solved 
for the highest derivatives. 

We shall now show that any nth-order system of differential equations 
which is solved for the highest derivatives may be reduced to a normal 
nth-order system. To begin, we shall show how one nth-order equation is 
reduced to an nth-order normal system. 


(A) Let (n) . m—-1) ind 
yy" =fGwH--sy ) (5) 


be an nth-order differential equation which is solved for the highest 
derivative. Here ¢ is an independent variable and y is an unknown function 
of the variable t. Further, f(t, y, y,..., y\) isa given function of n + 1 
variables t, y, y,..., y, which is defined in a certain domain I of an 
(n + 1)-dimensional space. With respect to the function 


IL --,y¥”) 


J 


we shall assume that it is continuous in I and that its partial derivatives 


< (n—1) 
etd ), k=0,1,...,n —1, 


(where it is assumed that y = y) are also continuous in T. To change 
equation (5) into a normal system of equations, new unknown functions 
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zi, 27, ..., 2" of the independent variable ¢ are introduced by means of 
the equalities 


ri= y, r=y,..., zm = yy), (6) 


1 2 
I 
#2 = 23, 
(7) 
ant = x” 
d 
z” = f(t,z!,x7,..., 2”). 
Theorem 2 implies that for every point to, Yo, Yo, .--, ys" » of T there 


exists a solution y = y(t) of equation (5) which satisfies the initial condi- 
tions 
(to) = y®, k=0,1,...,n— 1, 


or, as it is called, a solution with initial conditions 


to, Yo) Yo, say yg. (8) 


Further, any two solutions with initial values (8) coincide on the common 
part of their intervals of definition. If equation (5) is linear, i.e., if f is 
linear with respect to the variables y, y,..., y‘"», and if its coefficients 
are defined and continuous on the interval gq; < t < qo, then for any 
initial values to, yo, Yo,---, ys », where qi < t < qo, a solution 
y = w(é) exists which is defined on the entire interval gq, < ¢ < qo. 


We shall prove that equation (5) is equivalent to the system (7). We 


shall assume that y satisfies (5) and prove that the functions x',..., x” 
defined by (6) satisfy (7). By differentiating the relations (6) which intro- 
duce the new unknown functions z}, ..., x”, we obtain 
geay™, k=1,...,n—1, (9) 
= y™. (10) 


Replacing the right-hand sides of (9) on the basis of (6) and the right-hand 
side of (10) on the basis of (5), which is satisfied by y, we obtain (7). Let 
us assume, conversely, that the functions x!,...,2” satisfy (7); we 
shall then take x! for y and show that y satisfies (5). Setting 2! = y in 
the first of the equations of (7), we obtain x? = y. Substituting y for x” 
in the second equation of (7), we obtain x? = y. Continuing this con- 
struction further, we arrive at the relations (6). Finally, substituting in 
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the last of the equations of system (7) each function z!, ..., x” by those 
of (6), we obtain equation (5) for y. 

Since the function f is defined in I’, the right-hand sides of (7) are also de- 
fined in T under the condition that the change of coordinates obeys (6). For 
system (7) the conditions of Theorem 2 are satisfied in T. Thus it is 


possible to select arbitrarily the initial values fo, zo, x@,..., 2% in I. 
This set of initial values is transformed by (6) into a set of initial values 
t, Yo. Yo) -» +, Yo» for equation (5). 


If (5) is linear, then (7) is also linear. From this follows the final part 
of (A), by Theorem 3. Thus proposition (A) has been proved. 

The method described in (A) makes it possible to reduce to a normal 
system any system of differential equations which is solved for the highest 
derivatives. In order not to encumber the presentation with formulas, 
we shall investigate in the following proposition (B) a fourth-order system 
consisting of two equations. 

(B) Let 

a = f(t, U, u, v, v), 


(11) 


B= g(t, u, &, », 0) 


be a system of two second-order equations. Here ¢ is an independent 
variable, and u and v are the unknown functions. We shall reduce (11) toa 
normal system by introducing new unknowns z!, x”, x?, x* according to 


the formulas 


By this substitution, (11) is transformed into the system 


zl _— x, 

. 3 

# = tt, x}, x’, x } x), (12) 
3d 

£3 = zt, 

z* = g(t, x}, x7, x3, x*). 


If it is assumed that the functions f and g, on the right-hand sides of (11), 
are defined in a domain I of a five-dimensional space, where ¢, u, %, v, 0 are 
the coordinates of a point, these functions being continuous and having con- 
tinuous first-order partial derivatives with respect to u, u, v, 3, then the 
system (12) is normal and satisfies the conditions of Theorem 2 in I. 
From this it easily follows that for any point to, Uo, Uo, Yo, Jo of I there 
exists a solution u = g(t), v = y(t) of (11) which satisfies the initial 
conditions 
e(to) = Uo, P(to) = to, 


¥(to) = vo, ¥(to) = do. 


30 INTRODUCTION [cHaP. 1 


In addition, all solutions with identical initial conditions coincide on the 
common part of their intervals of existence. 
The proof of (B) is carried out in the same manner as the proof of (A). 
If one nth-order equation, given in the form 


F(t, y; y sso y y™) = 0, (13) 


is not solved for the highest-order derivative y‘” of the unknown, then 
the question immediately arises of its solvability with respect toy. We 
may assume that this question does not relate to the field of differential 
equations, but rather to the theory of functions. There arise, however, 
some questions, which are investigated in the theory of differential equa- 
tions, of the following character. Let us assume that equation (13) is 
quadratic with respect to the variable y‘”. Then it defines a two-valued 
function y™ of the remaining variables. Where the two values are really 
distinct, we actually arrive at two different equations of the form (5), but 
where the two values of variable y” defined by equation (13) coincide, 
splitting them into two equations of form (5) is impossible, and it is neces- 
sary to study equation (13). The study of such equations leads to the 
concept of singular solutions of the differential equation and to the study 
of equations on surfaces. These questions, however, will not be studied 
in this book. 


EXAMPLES 
1. We shall solve the equation 
E+w?x = 0, w= const. (14) 
By direct inspection we see that the function 


x = rcos (wt + a), r > 0, (18) 


where r and @ are constant, satisfies this equation. We shall show that (15) 
contains the set of all solutions. Let x = y(t) be any solution of (14). 
By Theorem 3 [see the end of (A)], it may be assumed that the solution 
x = ¢(t) is defined for all values of ¢. Let us set o(0) = x9, o(0) = Zp. 
We see easily that r and a can be selected in such a way that r cos a = 2, 
—rwsina = £o hold. 

If these equalities are satisfied, then the solutions (15) and ¢(é) have 
identical initial values 0, x9, @o, and therefore coincide [see (A)]. 

The function (15) describes the harmonic oscillatory process for the 
harmonic oscillator. The positive constant r is called the amplitude of the 
oscillation (15), and a is its :nztial phase or simply its phase. Equation (14) 
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Figure 10 


is called the equation of the harmonic oscillator. The number w is called 
the frequency of oscillation, although in reality the number of oscillations 
per second is determined by the formula 


w 


y= On 


2. We shall investigate the motion of a point p with mass m along 
a horizontal straight line / under the action of a force F which is attracting 
it toward the point O on the same straight line and is proportional to the 
distance between the points p and O. To form the equation of motion of 
the point p we shall take a coordinate system with the point O as origin 
on the line J. The variable coordinate of the point p we shall designate by 
x = x(t). Then, by Newton’s second law, the equation of motion of the 
point p will have the form 


mE = F = —kx. 
This equation is usually written in the form 
mi+ kx = 0. (16) 


Physically, the force F can be realized by a spring of some kind (Fig. 10). 
The number k is called the coefficient of elasticity of the spring. According 
to formula (15), the solution of (16) has the form 


r= reos(/Ert+ a), r> 0. 


Thus, the frequency of oscillation w = ./k/m of the point p is determined 
by its mass m and by the elasticity & of the spring; it does not depend on 
the initial conditions. The amplitude of the oscillation r and its initial 
phase a depend on the initial conditions, i.e., on the location xo of the 
point p and on its velocity <9 at the instant ¢ = 0. 

3. We shall formulate and give an approximate solution of the equation 
of the mathematical pendulum. The mathematical pendulum is represented 
by a point p of mass m which under the influence of gravity moves on the 
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Figure 11 


circumference K of radius / in the vertical plane. The quantity 1 is called 
the length of the pendulum. On the circumference K we shall introduce 
an angular coordinate by taking the lowest point O of the circumference 
as the origin of coordinates (Fig. 11). The variable coordinate of the point 
p we shall denote by » = ¢(f). The point p is subject to the gravitational 
force P = mg, directed vertically downward. The component of this 
force along the normal to the circumference is balanced by the normal 
component of the inertial force (i.e., centrifugal force with opposite sign) 
and by the reaction of the connection (of the circumference or of the thread 
which forces the point to move along the circumference); the component 
along the tangent to the circumference at point p in the direction of 
increase of the angle ¢ is equal to —mg sin ». Thus the equation of motion 
of the point p has the form ml¢ = —mg sin 9, or 


le + gsing = 0. (17) 


This equation is nonlinear and its solution presents great difficulties. 

If it is assumed that the coordinate ¢ of p is very close to zero during the 
motion, then in equation (17), ¢ may be substituted approximately for 
sin g and we obtain an “approximate” linear equation of the pendulum: 


le + ge = 0. 


Its solution has the form [see (15)} 


e= reos(/f¢+a)- 
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Thus the frequency of “small oscillations” of the pendulum is determined 
by the formula w» = V/g/l. 

The frequency of oscillation of the solutions of equation (17) depends 
on the amplitude of the oscillations and decreases when the amplitude of 
the oscillations increases. 

The number »y of small oscillations of a pendulum per second is deter- 
mined by the formula 

w 1 fg 


v= 5 7 


Qn 2 Vi 


For example, the length of a seconds pendulum, i.e., of a pendulum which 
performs one oscillation per second (vy = 1), is determined by the formula 


= 7% ~ 0.25 m. 

5. Complex differential equations. Up to this point we have considered 
only real equations and their real solutions. In certain cases, however, as 
in the case of the solution of linear equations with constant coefficients, 
it is easier to first find complex solutions of the real equation and then to 
select from them the real solutions. To present this approach we must 
introduce the concepts of a complex function of a real variable and of a 
complex system of differential equations. 

‘ (A) A complex function X(t) of a real variable ¢ is said to be defined if, 
to each é on a certain interval r; < ¢ < rg, there corresponds a complex 
number 


X(t) = eff) + wt), 


where g(t) and ¥(é) are real functions of the real variable t. The function 
g(t) is called the real part of the complex function X(é) and function y(Z) is 
called its imaginary part. A complex function X(é) is said to be continuous if 
the functions ¢(t) and ¥(£) are continuous. In exactly the same way, the 
complex function X(t) is called differentiable if the functions g(t) and y(t) 
are differentiable; the derivative X(t) of a complex function x(t) is defined 
by the formula 
X(t) = off) + Wd). 


By direct inspection we see that the usual rules for differentiation of the 
sum, product, and quotient of two complex functions of a real variable are 
valid. 


(B) Let ; 
# = hi(t,2',...,2"), grol,...,n (1) 


be a normal system of differential equations. We assume that the functions 
hi(t, z',..., 2") on the right-hand sides of the equations are defined for 
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complex values of the variables z',..., 2”. We can confine ourselves, for 
example, to the case when these functions are polynomials in the variables 
z!,..., 2" with coefficients which are real or complex continuous functions 
of a real variable ¢ defined and continuous on the interval gq; < t < qo. 
Under these conditions, it is quite legitimate to pose the problem of finding 
complex solutions of system (1). The system 


Z2=xX), jol....n (2) 


of complex functions of a real variable ¢ which are defined on some interval 
ry < t < rq will be called a solution of (1) if substitution of the functions 
(2) for the variables 2’ leads to a system of identities in ¢ on this interval. 
Since we are assuming that the right-hand sides of (1) are polynomials in 
z!,..., 2", they are defined for all values of these variables. Hence, the 
following existence and uniqueness theorem is valid for the system (1). 


Let 


to, 26, 20, sey 26 
be an arbitrary system of initial values. Here z,..., 29 are arbitrary 
complex numbers, and ¢9 is an arbitrary real number satisfying the condi- 
tion qi < to < qe. Then there exists a solution 


2? = x(t), j=il,...,%, 
of (1) which satisfies the initial conditions 


x(t) = 2, gol...,n 


Any two solutions with identical initial conditions coincide on the common 
part of their intervals of definition. 

If (1) is linear, i-e., if the polynomials h? are of first degree, then for any 
initial values there exists a solution of (1) defined on the entire interval 
Ma <t< qe. 

This existence and uniqueness theorem for a normal system of complex 
equations follows directly from Theorem 2 after every complex unknown 
function z’ is split into its real and imaginary parts. Indeed, let us assume 


gorwtiy), jgHl,...,n, (3) 


and make a change of the variables z’,j = 1,..., 7, in system (1) accord- 
ing to formulas (3); then we shall have 


ie + iy = ft, x}, se ve", y", srry y") 
+ igi(t, 2)... 2%, yy... 5¥"); (4) 
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where f? and g’ are real functions of real arguments which satisfy the 
relations 


fii,zi,...,¢y,y',-..,y¥") + ig(t,xt,...,a%yl,...,y”) 
= A}(t, a) + ty)... , 2" + ay”). 


Trom (4) it follows that 


P= fi,x',...,a,y',...,y), jul,...,n, - 
(5) 


ya —Ptjz',...,a%,y',-..,y"), j=l,...,n 


Thus, in place of the normal system (1) of complex equations, we sub- 
stitute a normal system (5) of real variables. Since the right-hand sides 
of equations (1) are polynomials in z', ..., 2”, the right-hand sides of (5) 
are polynomials in z!,..., x”, y!,..., y. Since the coefficients of the 
polynomials h? are continuous functions of ¢ on the interval q, < £ < qo, 
the coefficients of the polynomials f? and g’ are also continuous functions on 
the same interval. Thus, the right-hand sides of (5) are defined and satisfy 
the hypotheses of Theorem 2 in the domain I, which is defined by the 
single condition g; < ¢ < gz imposed on ¢, while the remaining variables 


zt,...,2" and y!,..., y” remain arbitrary. Setting 


zg = rh + iy}, j=il,...,n, 
MH) =e'OY+W, G=l...,n, 
we arrive at the problem of determining the solution of (5) under the 
initial conditions 
g(t.) = 2, jF=l,...,n, 
W(t) = ys, G=l,...,n 


By Theorem 2, this solution exists, and any two solutions with identical 
initial conditions coincide on the common part of their intervals of defini- 
tion. 

If (1) is linear, then (5) is also linear, and for this reason the final part of 
proposition (B) follows from Theorem 3. 

It should be noted that system (1), whose right-hand sides consist of 
polynomials in z!, ..., 2", can be real, i.c., the coefficients of these poly- 
nomials can be real functions of t. Nevertheless, we can also treat (1) as a 
complex system; namely, we can find its complex solutions, assuming that 
the functions z!, ..., 2" are complex. This approach to real equations is 
used because in certain cases it Is easier to find the complex solutions of 
real equations than the real solutions. In this case the complex solutions of 
a real system of equations are found first, and then the real solutions are 
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selected from the complex solutions; i.e., only those solutions are con- 
sidered whose imaginary part vanishes. Linear equations with constant 
coefficients will be solved later in exactly this way. 

Just as in the real case, we can reduce quite general systems of differ- 
ential equations to normal systems in the complex case. Thus in the com- 
plex case we have propositions analogous to propositions (A) and (B) of §4. 
Here we shall formulate only the existence theorems for an nth-order 
differential equation. 

(C) Let 

zm = ft, z, z, sey gh) (6) 


be an nth-order equation whose right-hand side is a polynomial in the 
variables z, 2,..., 2°"? and whose coefficients are continuous real or 
complex functions of é defined on the interval gq; < t < qo. If to, zo, 
zo,..., 26" are now arbitrary initial values, where zo, 20, ..., 26"? are 
arbitrary complex numbers, and if f is a real number satisfying q,; < 
to< qe, then there exists a solution z = ¢(£) of equation (6) which satisfies 


the initial conditions 
e(to) = Zo; p(to) = Zo, rs) oY (to) = zy. 


Any two solutions with identical initial conditions coincide on the common 
part of their intervals of definition. If (6) is linear, ie., if the polynomial 
f is of the first degree, then for arbitrary initial values there exists a solu- 
tion defined on the entire interval gq; < t < qo. 

In §7 and below, the complex function e™ of the real variable ¢, where ) is 
a complex number, will play an important role. We shall define this func- 
tion here and prove some of its properties. 

(D) Let w = u-+ w be an arbitrary complex number; let us assume that 


w 


e” = e”“ (cosv + 7sin v). (7) 


It is easy to see that the relation e” = e” holds. Below we shall prove 
the formula 
eviev2 — evitee (8) 


The well-known Euler formulas follow directly from (7): 


COS U ete” sin v a 
= —_———_ » = —— > 
2 21 


Let \ = uw + iv be a complex number. By (7) we have 


eM — eM (cos vt + isin vt). 
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We shall show that for complex values of \ the following differentiation 
formula is valid a 
@ rt ya at 
Hie hev'" (9) 


which is well known for real values of the parameter X. 
Formula (7), taken here as the definition of the function e” of the com- 
plex variable w, can be proved if the function e” is defined by the series 


=1p0¢ he + pO pee 


We shall, however, assume that the function e” is defined by (7). 
We shall prove formula (8). Setting 


WwW, = UW + 14, We = U2 + We 
yields 
evie™2 = e”1 (cos vy + 7 8in v;)e”2 (cos vg + 7 Sin V9) 


= e“it2 (cos (vy + vg) + isin (vy + vg)) = ets, 


We shall now prove formula (9). We shall consider first the case of a pure 
imaginary number A = zy. We have 


d ev? 


d os . . 
7] = 7 (00S vi-+7tsin vt) = —yvsin vt + tv cos vt 


= iv (cos vt + isin vt) = ive’ 


Further, for an arbitrary \ = uw + zy, by the formula for the differentia- 
tion of a product, we have 


d — * rt. eivt 


_ pee tpt + ivette tye = (u + weet _ re, 


ef) = — © (etter + ar 


EXAMPLES 
1. We shall investigate the complex equation 
z=, (10) 


where z = x + ty is a complex unknown function of a real variable ¢, and 
» = w+ wis a complex number. From (9) it follows that 


z= ce (11) 


is a solution of equation (10) for an arbitrary complex constant c. We 
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shall show that formula (11) contains the set of all solutions. For this, as 
in Example 1 of §1, it would be possible to use the uniqueness theorem, 
but here we shall use Theorem 3 to show how it can be used to simplify 
somewhat the calculations. In this case these simplifications are not 
particularly significant, but later an analogous method can give more 
substantial results. Thus, let z2 = x(t) be an arbitrary solution of (10). 
By virtue of Theorem 3 [see the last part of proposition (C)], we may 
assume that this solution is defined for all values of ¢. Setting x(0) = 2o, 
we see that the solution z = X(t) has as its initial values the numbers 0 
and 2p. It is clear that the solution 


2.=> zoe™, 


which is obtained from (11) by setting c = 2p, has the same initial values. 
If it is assumed that ¢ = re’, where r > O and a@ are real numbers, then 
(11) may be written in the form 


z= re tia (12) 
We now split equation (10) into its real and imaginary parts. We have 


E+ ty = (u + w(x + ty) = (uv — vy) + ivr + my) 


or 
= pr — vy, Y= vet py. (13) 


Thus the system (13) of two real equations is equivalent to one complex 
equation (10), and therefore an arbitrary solution x = ¢(t), y = Y(t) of 
(13) is related to the arbitrary solution (12) of (10) by the relation 


g(t) + w(t) = ret = r(e" cos (vt + a) + 7sin (vt + a)), 
whence we obtain 
x = g(t) = re” cos (vt + a), y = v(t) = re“ sin t+ a). (14) 


Thus, by using complex functions and equations, we have found the 
solution (14) of the system (13) of real equations. 
2. We shall present one more example of splitting a complex equation 
into two real ones. Let . 2 
2= 2°+ 122 
be a complex equation, where z = x + 7y is a complex unknown function 
of the real variable 7. We have 


t+ ty = (@ + ty)? + ie + ty) = (x? — y? — y) + try + 2) 


and therefore ; 3 3 
Z= 2° —y"—y, y= 2ry +2. 
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6. Some properties of linear differential equations. A system of dif- 
ferential equations is called linear if all unknown functions and their 
derivatives, taken together, enter the equations of the system linearly. 
Thus the system of linear equations of the most general type can be 
written in the form 


DY ain(2)" +.) =0, t=1,...,0. (1) 
ik 
Here z!,..., 2” are unknown functions of the independent variable t, and 


the coefficients a;;,(t) and the free terms 0,(t) of the equations are functions 
of ¢. If all free terms of (1) are identically zero, then the system is called 
homogeneous. To each linear system corresponds a homogeneous linear 
system obtained from it by discarding the free terms. Thus to the linear 
system (1) corresponds the homogeneous linear system 


SY andy)” =0, i=1,...,2. (2) 
g,k 


We shall note several immediate properties of linear systems. In formu- 
lating these, it will be assumed that all coefficients and free terms of the 
linear system are defined and continuous on the interval gq, < t < qo; 
all solutions under consideration will be assumed to be defined on the 
entire interval gq, < ¢ < qo. 

(A) Ify’ = ¢'(t) andy’ = Y(),7 = 1,..., nare two solutions of the 
linear homogeneous system (2), and if cy and cs are two arbitrary numbers, 
then the system of functions 


y" = c19'(t) + cov'(t), 1=1,...,%, 


is also a solution of the homogeneous system (2). An analogous assertion 
is also valid for three or more solutions of (2). 

(B) If «* = y(t) and z' = x‘(),7 = 1, ..., n, are two solutions of 
(1), then the system of functions 


y=xXO-VO, t=1...52 


is a solution of the system of homogeneous equations (2). Further, if 
y' = ¢'(t), i = 1, ..., 7, is a solution of the homogeneous system (2), 
and x’ = y(t), i = 1, ..., n, is a solution of (1), then the system of 
functions . . . : 
z= ¢'(t) + v0), a= 1]1,...,%, 

is also a solution of (1). 

(C) We shall assume that the free terms of (1) may be written in the 
form of sums 


bi(t) = ac,(t) + Bd(t), it=1,...,7n. 
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We shall consider along with system (1) two systems of equations: 


DY ain Oe) + c(t) = (3) 
j,k 


| 
S 
S. 

ll 
hon 
3 


I 
S 
2. 

| 
om 
3 


Dd aize(t) (2?) + dil) (4) 
jk 


If ct = y(t), i = 1, ...,n, is a solution of (3) and 2’ = x‘(), 7 = 1, 
...,%, is a solution of (4), then the system of functions 


x’ = a(t) + Bx), t= 1,...,2, 


is a solution of (1). 


CHAPTER 2 
LINEAR EQUATIONS WITH CONSTANT COEFFICIENTS 


Systems of ordinary differential equations with constant coefficients 
constitute a large and important class of ordinary differential equations 
which may be solved completely with the aid of elementary functions. 
In view of the fact that the solution of these equations does not, in prin- 
ciple, present any great difficulties, they are often considered to be of no 
great interest for theory, and in textbooks they are usually relegated to the 
position of simple exercises appended to the general theory of linear equa- 
tions. Linear equations with constant coefficients, nevertheless, have 
numerous engineering applications, since the performance of many tech- 
nical devices is described in an adequate manner by these equations. It is 
precisely the engineering applications which bring forward a series of new 
problems of a theoretical nature in the theory of linear equations with 
constant coefficients. To the solution of these theoretical problems many 
studies of an applied nature have been devoted, and some of these are 
reflected in the present chapter. Thus in this chapter we employ the 
operational notation which is customary in engineering practice and which 
is very convenient for the solution of systems of equations by the method 
of elimination. The problem of the stability of solutions of systems of 
linear equations, which is very important in the theory of automatic 
control, is studied. Further, we shall develop the so-called method of 
complex amplitude, which is a convenient means for determining particular 
steady-state solutions and is widely applied in electrical engineering. 

Rather than confining ourselves to the solution of the purely mathemati- 
cal problems arising from applications, we shall present here in very short 
dogmatic form an exposition of the theory of electrical circuits. The design 
of electrical circuits gives a good and important, from the engineering 
point of view, illustration of the mathematical methods developed in this 
chapter. 

In addition, the present chapter includes a study of the phase plane of 
second-order linear systems, which is preceded by a very elementary 
study of phase spaces of (generally speaking, nonlinear) autonomous 
systems. Phase spaces of autonomous systems also find important appli- 
cations in engineering. 

Because of what has been said above, the chapter on linear equations 
with constant coefficients occupies a considerably larger place in this book 
than is customary in textbooks on the theory of ordinary differential 
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equations, The presentation of all the material of the present chapter is 
very elementary, with the sole exception of §14, where matrices in Jordan 
form are used. Everything done with the aid of matrices in Jordan form 
may be skipped, as has been indicated in detail in §14, since it is not used 
later in the book. 


7. The linear homogeneous equation with constant coefficients. Case 
of simple roots. In this section and in the following we shall solve the 
linear homogeneous nth-order equation with constant coefficients, L.e., 
the equation 


2 aye") tees +t an_12 + anz = 0, (1) 


where z is an unknown function of the independent variable f, and the 
coefficients a1, ..., @, are constants (real or complex). Tirst, we shall 
find all complex solutions of this equation, and then (in the case where 
coefficients a1, ..., @n are real) separate from them the real solutions. 
Equation (1) can be written in the form 


zZ™ = —ayz™—) — «++ — an _12 — One, (2) 
so that the existence and uniqueness theorem can be applied to it [see 
proposition (C), §5]. Later on, we shall use only the uniqueness, since 
solutions of equation (2) will be found explicitly, and their existence will 
be established by this fact; uniqueness itself will be used to prove that all 
the solutions have been found. 

In engineering applications of ordinary differential equations with con- 
stant coefficients, an important role is played by the operational calculus. 
We shall use here the symbolic (or operational) notation which is the basis 
of the operational calculus. The essence of this notation consists of the 
fact that the derivative of an arbitrary function z = 2(é) with respect to 
the time ¢ is designated not by (d/dt)z, but by pz, so that the letter p placed 
to the left of the function is a differentiation symbol with respect tot. If 
we apply to the differentiation symbol p certain algebraic operations, we 
are led to the notation 

dF 
dik e.= pez. 
By using this notation, we can write 


agz™ + ayz™—P foes + aad + ane 


= agp’z + ayp"—'2 + +++ + an_ipe + ape. 


If in the right-hand side of the last equation we take the function z out of 
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the parentheses, we obtain the equality 


agz” + ayz™—P 4 0+ + ani + Onz 
= (agp” + ayp*~! + +++ + Gasp + an)z. 
Thus we come to a formal definition. 
(A) Let 
L(p) = aop” + ayp"—} + ose + Gn—1p + an 
be an arbitrary polynomial with constant coefficients (real or complex) 


with respect to the symbol p, and let z be a certain real or complex function 
of a real variable ¢. We set 


L(p)e = agz™ + aye) 4 e+ ag yd + ane. (3) 


If L(p) and M(p) are two arbitrary polynomials in the symbol p (or, as 
we say, in the differentiation operator p) and 2, 21, 22 are functions of ¢, then 
we have the identities 


L(p)(z, + 22) = L(p)zi + L(p)2e, 
(L(p) + M(p))z = L(p)z + M(p)z, 
L(p)(M(p)z) = (L(p)M(p))z. 


By the notation introduced, equation (1) can be written in the form 


L(p)z = 0, (4) 
where 
L(p) = p® + ayp™"! + +++ + an_-ip + an. 


(B) Let L(p) be an arbitrary polynomial with respect to the symbol p. 
Then 
L(p)e* = L)e™. (5) 


We shall prove formula (5). We have 
pe = rer’ 


[see formula (9), §5]. From this it follows that p*e’’ = v*e’. Hence 
formula (5) follows immediately [see (3)]. 

It follows directly from formula (5) that the function e™ is a solution 
of equation (4) if and only if the number ) is a root of the polynomial 
L(p). The polynomial L(p) is called the characteristic polynomial of equa- 
tion (4). In the case when it has no multiple roots, the set of all solutions 
of (4) is described by the following theorem. 
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TuHrEoreM 4. If the characteristic polynomial L(p) of the equation 
L(p)z = 0 (6) 


[see (1) and (4)] has no multiple roots, if its roots are 


Ay, A2,-- +) An, 
and if we set 
zy = et Zo = ed! 1) en = ent (7) 
then for any complex constants c!, c?,..., c”, the function 
z= clay + c7zg + +++ 4+ C7 Zn (8) 


is the solution of equation (6). This solution is the general solution in 
the sense that every solution of equation (6) can be obtained from (8) 


by proper choice of the constants c', c?,..., c”. Here the constants 


c',c?,...,c” (which are called integration constants) are defined uniquely 


for every given solution z. 


Thus the functions (7) constitute the so-called fundamental system of 
solutions of (6) [see §18, (C)]. 

We note that the functions (7) are defined over the entire axis —x < 
ti < +00. 


Proof. Jt follows from (5) that every function of the system (7) is a 
solution of (6), and, because (6) is linear and homogeneous, it follows 
[see §6, (A)] that, for any complex constants c', c?, ..., c”, (8) gives the 
solution of (6). We shall show that if zx = z(t) isan arbitrary solution of 
(6), then it can be written in the form (8). By proposition (C), §5, we can 
consider the solution z+ to be defined on the entire axis —o < Ut < oo. 
Let us set 


z,.(0) = %0) 2,(0) = Zo, sey zy? (0) = zg). 


We shall now show that constants c!, c?, ..., ec” can be chosen in such a 
way that the solution z(t), defined by (8), satisfies the same initial conditions 


z(0) = Zo, 20) = 2, ..., 277-20) = 2) (9) 
Substituting the function z from (8) into the equations (9), we obtain 
c'2(0) +e + c%20) = 2; os = O,1,...,n — 1. (10) 


The relations (10) represent a system of equations in the unknowns 
c}, c?,..., ce”. In order that (10) may be solved, it is sufficient that the 
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determinant of the matrix 


z1(0) 22(0) Z,(0) 
2,(0) 22(0) 2n(0) 
2-0) o-2@)... 2-20) 
Z—Po) ef PO)... 27 PO) 


does not vanish. 
It is immediately evident that the matrix (11) has the form 


1 1 1 
AL re) An 
rn rg 2 
Dalen Ven Ve 


and therefore its determinant (Vandermonde’s determinant) is different 
from zero because all the numbers dj, do, ..-, An are mutually distinct. 
However, we shall give another (direct) proof that the determinant of (11) 
is different from zero. Later this proof will also be extended to the case 
of multiple roots. 

If the determinant of matrix (11) were zero, then we could find a linear 
dependence between its rows. Let us assume that this linear dependence 
does exist. This means that there exist numbers 6,1, bn_2,..., bo, not 
all zero, such that, if we multiply the rows of the matrix (11) by these 
numbers and add, we obtain a row of zeros. If we calculate the nth term 
of this zero row, we obtain 


bn—124(0) + Bn —a2e(O) + ++» + bre? (0) + bo2i'(0) = 0. (12) 
If we denote by M(p) the polynomial 
bop?" + dip"? + +++ + ba-op + baa, 
we may write (12) in the form 
M(p)ézi|1—0 = 0. 
By virtue of formulas (5) and (7), we thus obtain 


M(x) = 0, 
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which is impossible, since the degree of M(p) does not exceed n — 1, so 
that it cannot have n distinct roots \1,..., Az, ..., An. This contradiction 
indicates that the determinant of (10) cannot be zero; consequently, the 
constants c}, c?, ..., ce” can be determined (and uniquely, moreover) so 
that the solutions zx (¢) and 2(£) satisfy identical initial conditions. For this 
choice (and only for this choice) of constants the solution (8) coincides 
with the given solution z*(¢). Thus Theorem 4 is proved. 

If the coefficients of the polynomial L(p) in equation (6) are real, then 
we have the problem of separating the real solutions from the set (8) of all 
complex solutions. The solution of this problem rests upon proposition 
(D), in the formulation and proof of which we shall use vector notation. 
We shall recall that notation here. 

(C) We shall call the sequence consisting of 7 numbers, 


u= (wy, u’,..., 4”), 


a vector of an n-dimensional space. Here u is the vector and the numbers 
ul, u?,..., u™ are called its coordinates (or components). We shall always 
designate vectors by boldface letters. If the coordinates of the vector are 
real numbers, then the vector is called real; if its coordinates are complex, 
then the vector itself is considered complex. The vector u, which is the 
complex conjugate of the vector u, is defined by the equality 


It is clear that vector u is real if and only if 
u=u. 


The product of the vector u = (u!, u?,..., u”) with a real or complex 
number a is defined by the formula 


au = ua = (au!, au?,..., au”). 
The sum of the vectors 
u= (u},u7,...,u") and v= (v},0?,...,07) 
is defined by the formula 
utv= (wl tol wu? +07 ..2,u™ +0"). 
The vector 0, all of whose coordinates are zero, is called the null vector. 


Let 
Wy, Ue,..., U;, 


7] LINEAR EQUATION, CASE OF SIMPLE ROOTS 47 


be a finite system of vectors. The relation 
atu, + a7ug +--- + au, = 0, 


where not all of a!, a7, ..., a” are zero, is called a linear dependence 
among the vectors u;, Us,..., U,. If the vectors are not linearly depend- 
ent, then they are called linearly independent. Let 


uj = (uj,uj,...,u3),  g=l...yr 

The numbers uj form a matrix (uj);i = 1,...,n;7 = 1,...,7. If we 
assume that the upper index 7 designates the number of the row and the 
lower index j the number of the column, then the matrix (uj) has height n 
and width r. Thus we have a correspondence between vector u; in the 
matrix (uj) and the jth column, which consists of the coordinates of this 
vector. Hence, it is clear that to a linear dependence among the vectors 
Uj, Ug, ..., U, corresponds a linear dependence between the columns of 
the matrix (uj). Whenever r = n, the matrix (uj) is a square matrix, 
and the vectors uj, Ug, ..., U, are linearly independent if and only if the 
determinant [uj of this matrix is not zero. 


(D) Let 
Zi, Z2,-..,2n (18) 


be a system of n linearly independent complex vectors in an n-dimensional 
space. Let us assume that (13) also contains the conjugate of every vector 
in (13). Under these assumptions, the vector z, defined by the formula 


z= c'Z, +--+ + c'Zn, (14) 


is real if and only if the coefficients of all pairs of conjugate vectors are 
conjugate and the coefficients of all real vectors are real. 
Let us prove this. To be definite, we shall assume that the relations 


Zi = Zg,..-, Zok—1 = Zok, 
Zj = Z;; j= 2kh+1,...,7 
are satisfied. Then according to formula (14) the vector z has the form 


1 2k-1 2k 
z= c'z, + 0722 +--+ + 0 zon + 0 Za, 
2k+1 
+ czy pe beth, (15) 


while the vector z has the form 


> =2 = =2 =-2k— 
Z = 072, + Blzg +--+ + Orzo, + 2-125, 
=-2 1 = 
See 218 Oe ee (16) 
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aia c2et1 _— eter a c= &”, (17) 
then it follows from (15) and (16) that z = z, i.e., the vector z is real. If, 
conversely, we assume that z is real, i.e., that z = z, then (15) and (16) 
give [by the linear independence of the vectors (13)] the system of rela- 
tions (17). Thus, proposition (D) is proved. 

Proposition (E) below presents a method of separating real solutions 
from the set of all complex solutions of equation (6) in the case that the 
coefficients of the polynomial L(p) are real. 

(E) Let us assume that the coefficients of L(p) are real; then to every 
complex root \ of L(p) corresponds a conjugate root X. The solutions e* 
and e* of equation (6) are conjugate to each other [see §5, (D)]. If the 
root » is real, then the solution e™ is real. Thus, corresponding to every 
solution of the fundamental system (7), there is also a complex conjugate 
solution. For the solution (8) of (6) to be real, it is necessary and sufficient 
that the coefficients of pairs of complex conjugate solutions be conjugate 
and the coefficients of real solutions be real. 

For proof let us denote by z; the vector with coordinates 


{z;,(0), 2,(0), one , 2” (0), zr (0)} 


and by z the vector with coordinates {zo, 2o,..., 28°}. Then (10) 


takes the form 
e'z, + e7zg +--+ + eZ, = 2. 


The vectors Z, Zo, ..., Zn are linearly independent, since the determinant 
of (11) is not zero. Thus, the necessity of the condition presented in (E) 
follows directly from (D). On the other hand, if this condition is fulfilled, 
then the solution (8) is real. Indeed, if 4; and A» are two complex con- 
jugate roots, and c’ and c? are two complex conjugate constants, then 


cle! and ce! are complex conjugate functions and, consequently, their 


sum is real. Thus proposition (E) is proved. 
EXAMPLES 
1. We shall find all complex solutions of the equation 
z® — 324 9+ 132 = 0, 
which can be written in the form (6), where 
L(p) = p? — 3p? + 9p + 13. 


By direct inspection, we see that p = —1 is a root of the characteristic 
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polynomial L(p). Factoring L(p) by p + 1, we obtain 


L(p) = (p + 1)(p? — 4p + 13), 
from which we obtain two more roots, 2 + 32. Thus the roots of L(p) are 
the numbers 
Ay = 2+ 31, Ag = 2 — 3, 43 = —Il. 


By Theorem 4 the general complex solution of the equation under con- 
sideration has the form 


= cle 2t3ve + c2e(2—8t + cee, 


In Examples 2 and 3 below we give two general rules for determining 
the real solutions. These rules follow directly from proposition (E). 

2. We shall assume that the fundamental system of solutions (7) satisfies 
the conditions 


2] = 22, ..., 2ak—-1 = Zak, 2ok41 = Zok41; «+--+ ln = 2n, (18) 
and set 
4 = 2, + ty, ..-, 2er-1 = Te + Yk, 

where 21, ..., Zk, Y1,---, Ys are real functions. We shall further assume 
that the numbers c!, c?, ... , e” satisfy (17), and set 

1_ lf, hl 2k—-1 _ ag_k “pK 

c = 3(a — 1b’), ..., € = $(a” — 76"), 
where a}, ..., a*, b!, ..., b* are real numbers. With this notation the 


general real solution of (6) may be written in the form 
z= ala, + bly, t-++ taken + bey, + CT ong $e + cn, 


where 


al bi, ..., a, be c2RtI gt 
are arbitrary real numbers. Thus, if for every pair of conjugate complex 
solutions we substitute their real and imaginary parts in the fundamental 
system (7), we shall obtain a fundamental system of real solutions. 


3. Again we shall assume that the solutions (7) satisfy (18); let us set 
AL => 1 + Wy, wy Nok—1 = ME + Wr. 
Under the hypothesis that c’, c?, ..., c” satisfy (17), we may set 


1 i 2k—1 , 
c' = dpe’, ..., € = 4p’. 
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In this notation every real solution z may be written in the form: 


z= p, e“!' cos (Vit + ay) + +++ + pg et’ cos (vat + ax) 
fe cPktl dantit 4... 4 ot erat 


Here p1,..., Px, 1,..., aR, c2**+! ... ce are arbitrary real constants. 


From the last expression it is evident that every imaginary part v; ~ 0 
of the root d; gives the solution an oscillatory character with a frequency 
v;, while every real part u; of the root \; causes the solution either to grow 
(if u; > 0) or become smaller (if u; < 0). 

4. By using the results of Examples 2 and 3, we can write all the real 
solutions of the equation investigated in Example 1 in the following 
two forms: 

z = ale* cos 3¢t + ble sin 3¢ + c¥e—!, 


z= pie” cos (Bt + a1) + cle. 


8. The linear homogeneous equation with constant coefficients. Case 
of multiple roots. If the characteristic polynomial 


L(p) = p” +ayp” | + +++ + an_ip + an 


of the equation 
L(p)z = 0 (1) 


[see §7, (A)] has multiple roots, then by functions of the form e™ it is not 
possible to find n distinct solutions of equation (1). To find solutions of 
another form in this case, we can use the following heuristic reasoning. 
Let 4; and dz be two distinct real roots of the characteristic polynomial 
L(p); then the function (e%! — e')/(A, — dg) isa solution of (1). If we 
now assume that with a change in the coefficients of L(p) the number A» 
tends to 3, then this solution tends (in the limit) to the function te’, 
which may naturally be assumed to be a solution of (1) whenever \, is a 
double root of the polynomial E(p). Similarly, we arrive at the conjecture 
that, if \ is a k-tuple root of the characteristic polynomial L(p), then all 


the functions 
et te pe—V phe 


are solutions of (1). Extending this conjecture to the case of complex 
multiple roots, we come to the question of the validity of the following 
theorem (which is a generalization of Theorem 4): 


THEOREM 5. Let 
L(p)z = 0 (2) 


be a linear nth-order homogeneous equation with constant coefficients. 
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Further, let 1, ..., Am be the set of all mutually distinct roots of the 
characteristic polynomial L(p) of equation (2), the root \; having the 
multiplicity k;, so that kj + k2 + -+---+k, = n. If we set 


zy = et zg = te’, ..., a = tiled. 
—_ prgt — Agt — zpko—l rot. 

Ze+1 = e?, Zk, 42 = le, caey Zk +k, = 0? ee; (3) 
y Zn = thm—1Qrm! 


then all the functions (8) are solutions of (2), so that for any complex 
constants c!, c?,..., ¢”, the function 


z= cla; f+++ +c", (4) 


is also a solution of (2). This solution is the general solution in the 
sense that every solution of (2) can be obtained from (4) by a proper 
choice of the constants c', ..., c”, where the constants c!,..., ce” are 
defined uniquely for every given solution z. 


Thus the functions (3) constitute the so-called fundamental system of 
solutions [see §18, (C)] of equation (2). We note that the functions (3) 
are defined on the entire axis —o < t < +00. 

We shall preface the proof of Theorem 5 with a proof of the so-called 
shift formula. 

(A) Let L(p) be an arbitrary polynomial, \ any complex number, and 
f(t) a function which can be differentiated an arbitrary and sufficient 
number of times. Then the following important formula is valid: 


L(p) (ef) = e+ Lip + df (0. (5) 


We shall prove formula (5), but first we shall verify it for the case 
L(p) = p. We have 


pleNf()) = ref) + MAY = eM(p + HO. 


Now it is easy to verify formula (5) for any first-degree polynomial L(p) = 
ap +b. We have 


(ap + b) (eft) = ap(e“f(t)) + bef(t) 
ae™'(p + d)f(t) + be“ f(t) = e[a(p +) + d1F(). 


In the general case, we shall carry out the proof of formula (5) inductively 
with respect to the degree n of polynomial L(p). As we have seen, the 
formula is valid for » = 1. Let us assume that it is also valid for a poly- 
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nomial of degree n — 1, > 2, and prove it for a polynomial L(p) of 
degree n. To do this we shall factor the nth-degree polynomial L(p) into 
two factors, L(p) = L,(p) - Le(p), where L,(p) is of the first degree and 
L2(p) is of degree n — 1. Since formula (5) is correct for each of the 
polynomials L,(p) and Le(p), we have [see §7, (A)] 


L(p)('F() = Li(p)[Le(p) (e“F)] = Lip) (e“ La + HFM) 
ei (p + d)Lo(p + MFO = eML(p + AF. 


Thus formula (5) is proved. 

We shall now prove proposition (B), which includes Theorem 5 almost 
entirely. 

(B) Let L(p) be an arbitrary polynomial in the symbol p, and let the 
function w,(é) of the real variable ¢ be defined by the formula 


w(t) = L(p)t"e™, 


where \ is a complex number. We find that, if \ is a k-tuple root of L(p), 


then the functions wo(t),..., w,—1(t) are identically zero. On the other 
hand, we find that, if the functions wo(é),..., w,—1(t) equal zero for even 
one value ¢ = fo, ie., if the equalities 

wo(to) = wy(to) = +++ = wr_ilto) = 0 (6) 


are valid, then \ is a root of L(p) of multiplicity not less than k. 
Let us prove proposition (B). By the shift formula [see (5)] we have 


w(t) = L(p + dt. (7) 
Let us assume first that is a k-tuple root of L(p), i-e., that 
L(p) = M(p)(p — d)*. 
If we replace p by p + A in this identity, we obtain 
L(p +d) = M(p + djp*. (8) 
T'rom formulas (7) and (8) we obtain 
w(t) = &M(p + d)(p*t?) = 0 for r=—0,1,...,k — 1, 


since p*t” = Oforr < k. Thus the first part of proposition (B) is proved. 
Let us now assume that (6) holds. If we expand L(p + A) into powers 
of p, we obtain 


L(p +d) = bo + bip + +++ + bn_ip™* + bap”. (9) 


8] LINEAR EQUATION, CASE OF MULTIPLE ROOTS 53 


From (7) and (9) we obtain 
wo(lo) = edo, 

which by virtue of (6) gives 

bo = 0. 
Let us now assume that the equalities 

bo = bi = --- = 3,1 = 0, r<k-1, (10) 
are valid and prove that b, = 0. By formulas (7), (9), and (10) we have 
w,(to) = eo rlb,. 

It follows from this and formula (6) that 

b, = 0. 


Thus bp = by = +++ = b,_1 = 0, and the polynomial L(p + A) has the 
form 


L(p +) = bep* + +++ + bap” = (be + +++ + bap**)p* = My(p)p*. 
Substituting p — for p in this identity we obtain 
Lip) = Mi(p — d)- (p — »)*, 


which shows that \ is a root of the polynomial L(p) of multiplicity no 
less than k. Thus proposition (B) is proved. 

Proof of Theorem 5. From the first part of proposition (B) it follows 
directly that the functions (3), defined in the formulation of Theorem 5, 
are solutions of (2). We shall prove that they constitute the fundamental 
system of solutions. To prove this it is sufficient to show that by a proper 
choice of the constants c!, ..., ¢” we can obtain from (4) an arbitrary 
solution 2« of equation (2). We shall prove this (here, unlike our proof of 
Theorem 4, we shall not use Theorem 3). 

Let zx be an arbitrary solution of equation (2) defined on the interval 
ry <t < re, and let tg be some point of this interval. Let us set 


z(t) = 20, 2,(to) = 20, ---, 2"-Yt9) _ 2—), 


We shall now seek constants c',... , c” such that a solution z of equation 
(2) defined by (4) will satisfy the same initial conditions as the given 
solution zx. Then by the uniqueness theorem we shall have z = zx (on 
the interval r; < ¢ < re). To determine the constants c}, ..., c”, we 
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have the system of equations 
cz}? (to) + c72$?(to) + - "2 (to) = 26”, s=0,1,...,n—1. 
(11) 


In order that system (11) be solvable, it is sufficient that the determinant 
of the matrix 


A1(l0) zo(to) Zn(to) 
XCD 25 (to) zn (to) (12) 
LPs) 2p —Y(to) 2. eto) 


be different from zero. We shall show that this determinant is not equal 
to zero. For this we shall show that the rows of (12) are linearly inde- 
pendent. Let us assume the contrary and let bp_1, bp_2, ..., bo be con- 
stants, not all zero, by which the first, second, ..., rows of the matrix 
are to be multiplied in order that their sum be zero. By writing the sum 
of the elements of the jth column, we obtain the equality 


02) (to) + bie (lo) + +++ + bn —2ei(to) + bn—12;(to) = 0, 


which can be written in the form 


M(p)z;\t=t) = 9, (18) 
where M(p) = bop"—! + byp"—? + +++ + ba_op + bp_1. The equality 
(13) for 7 = 1,..., k, shows that A, is a root of multiplicity at least k, of 


the polynomial M (p) [see proposition (B)]. In exactly the same way 
the equality obtained for 7 = kj + 1, ..., k; + ke shows that A» is a 
root of multiplicity at least ky of the polynomial M(p). The set of all 
equalities (13) leads to the conclusion that (taking multiplicities into 
account) the polynomial M(p) has not less than n roots, but this is im- 
possible since its degree is no higher than n — 1. Thus the assumption 
that the determinant of (12) is equal to zero has led us to a contradiction; 
this means that (11) is solvable (and, moreover, uniquely) in terms of the 
unknowns 


Thus Theorem 5 is fully proved. 
We shall note one obvious corollary of Theorem 5. 
(C) Every solution z(é) of equation (2) can be written in the form 


a(t) = fi(t)e’ + fo(thes’ + +++ + fn(ierm, 
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where f;(£) is a polynomial of degree not exceeding the number k; — 1, 
j=l, ,m. Here the polynomials f, (0), ..., fm(é) are defined uniquely 
by the solution z(t), since their coefficients are the integration constants 
c},c?,...,¢" which by Theorem 5 are defined uniquely by the solution z(t). 

If the coefficients of equation (2) are real, then the problem facing us is 
that of separating real solutions from the set of complex solutions of (2). 

(D) We shall assume that the coefficients of the characteristic poly- 
nomial L(p) of equation (2) are real. Let X be a certain root of L(p) of 
multiplicity k; then, forr = 0,1,..., & — 1, the function ée™ is a solu- 
tion of (2). If the root ) is real, then the function t’e™ is real; if \ is com- 
plex, then in addition to the solution te there is also the complex con- 
jugate solution ¢’e4, since X is also a root of multiplicity k of L(p). Thus 
whenever the fundamental system (3) admits a complex solution it admits 
the complex conjugate as a solution. In order that the solution (4) be real, 
it is necessary and sufficient that the coefficients of real solutions be real, 
and the coefficients of pairs of conjugate complex solutions be complex 
conjugate. 

The proof of proposition (D) is carried out in exactly the same way as 
the proof of proposition (E) of §7 on the basis of proposition (D) of §7. 


EXAMPLES 
1. We shall solve the equation 
23) + 324) + 32/7 + 2’ = 0. 


This equation can be written in the form (2), where the characteristic 
polynomial L(p) has the form 


p° + 3p* + 3p? + p? = p'(p + 1). 


The numbers 
At = 0, 
Ay = —1 


are roots of multiplicity k; = 2, k2 = 3, respectively, of this polynomial. 
Therefore by Theorem 5 the fundamental system of solutions of this 
equation has the form 


z4, = 1, Zo = t, @3=>e , zZ4 = le," z5 = te 
The general solution is given by the formula 


z= (ce! + ct) + (c? + ett + c8t?)e—. 
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2. Let us solve the equation 
2 + 22” +2= 0. 


The characteristic polynomial is L(p) = (p? + 1)?; the numbers A; = 7, 
hg = —17 are its two roots, and its general solution may be written in the 
form 


z= (cl + ete + (c3? + ett)e—*. 


The following two examples give the general rules for distinguishing 
real solutions stemming directly from proposition (D). Examples 3 and 4 
are completely analogous to Examples 2 and 3 of §7. 

3. In Example 2 of §7 the concrete form of the solution was not taken 
into account, and it was only assumed that the fundamental system of 
solutions consisted of mutually conjugate solutions and real solutions. By 
the same reasoning, therefore, in the case of multiple roots we have the 
following general rule: In the fundamental system (3) it is necessary that 
each pair of complex conjugate solutions be replaced by the real and 
imaginary parts of one of these solutions. The system of functions so 
obtained is the fundamental system of real solutions. 

4, Let 


te tTert 


be two complex conjugate solutions of (3). In the case of a real solution z, 
that part of the sum (4) corresponding to these solutions may be written 
in the form 

= ct eu tint + zie eM 


If we set 


we shall have 
2 = pte’ cos (vt + a). (14) 


In this way it is possible to replace every pair of complex conjugate 
solutions appearing in (4) by a real function of the form (14) containing 
two arbitrary real constants p and a. Here again, as in Example 3 of §7, 
it is evident that if a root \ has a nonzero imaginary part v, the solution 
has an oscillatory character, and if the real part » of \ is nonzero, then the 
solution either increases (for u > 0) or decreases (for wp < 0). Finally, 
if the root \ is multiple, there appears an additional term ¢”, which causes 
a further increase of the solution; however, as t — oo in the caseyw < 0, 
the increase of the solution caused by factor ¢” is considerably less than the 
decrease caused by factor e”’, so that for » < 0 (and for any order of 
multiplicity of the root) the solution tends to zero as ¢ increases. 
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5. Using the results of Examples 3 and 4 we can write all real solutions 
of the equation considered in Example 2 in the following two forms: 


z= (a! + at) cost + (b! + bt) sin f, 
= p; cos (t + a,) + pet cos (t + ag). 


n 


9. Stable polynomials. Let 
L(p)z = 0 (1) 


be a linear homogeneous equation with constant coefficients. The question 
of how the solutions of this equation behave as t — ++oo (whether they 
tend to zero, remain bounded, or increase without limit) plays a very 
important role in a whole series of applications of the theory of ordinary 
differential equations. In Examples 3 of §7 and 4 of §8 it has already been 
noted that this question of the behavior of solutions of equation (1) is 
related to the nature of the real parts of the roots of polynomial L(p). We 
shall now formulate this problem more precisely. 

(A) The polynomial L(p) is called stable if all its roots have negative 
real parts or, in geometrical terms, are located on the left-hand side of the 
imaginary axis in the complex plane. Let 


Ay = wy + Y;, j=il,...,m 


denote all the roots of the polynomial L(p). If this polynomial is stable, 
there exists a positive number a such that 


Lj < —a, j=il,...,m. (2) 


We shall show that in this case for every solution g(¢) of equation (1) a 
positive number M can be found such that 


le(t)| < Me™ for t > 0. (3) 


This formula not only shows that every solution of equation (1) tends to 
zero as t — oo, but also gives us an estimate of the rate of convergence 
to zero. 

We shall prove (8) first for an arbitrary fundamental solution z,, s = 


1,..., 7, of equation 1 [see (3), §8]. We have 
z,= te! whence 8) tei tere, 
eat 


Since the number yu; + a is negative because of (2), the function te%**! 


tends to zero ast — o and therefore is bounded fort > 0. Thus we have 


I~ <M, for t>0, 


e—at 
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or, what is the same thing, 


lz] < M,e—** for t> 0. 
If now 
o(t) = clzy + cPzg +--+ + 0%q 
is an arbitrary solution of equation (1), then for t > 0 we have 
le(t)| < (jet] +My + fe?|- Me +--+ + |e"|-My)e! = Me™. 


Thus the inequality (3) is proved. It should be noted that if even one 
of the roots \; of L(p) has a positive real part u; > 0, then there exists a 
solution e*s' of equation (1) which increases without bound as t > o. 

Many studies by mathematicians have been devoted to an applicable 
formulation of stability conditions of polynomials. For second-degree 
polynomials the stability condition is derived directly from the solution 
of a quadratic equation [see (B)]. The stability problem for polynomials 
of an arbitrary degree n was solved in several different forms by the 
mathematicians Rauss and Hurwitz. The Rauss-Hurwitz conditions, 
however, are inconvenient for actual computation, and, therefore, the work 
on new formulations of stability conditions continues. Here, a proof of the 
Rauss-Hurwitz criterion for n = 3 will be presented, and the stability 
condition for an arbitrary degree n will be given in the Hurwitz form with- 
out proof. 

(B) A second-degree polynomial L(p) = p?+ap+ ) with real 
coefficients a and b is stable if and only if its coefficients are positive. 

This assertion is easy to verify with the aid of the formula for the solu- 
tion of a quadratic equation. 

(C) If the polynomial L(p) = p” + ayp"—! + +--+ a_ with real 
coefficients is stable, then all its coefficients are positive. 

For the proof, we shall factor L(p) into its first- and second-degree fac- 
tors, i.e., into factors of the form p + cand p? + ap + b. Since L(p) is 
stable, each of its factors in this form is also stable. Tor the stability of the 
factor p +c it is necessary that the number c be positive, and for the 
stability of the factor p? + ap + b it is necessary that both a and b be 
positive. Since the coefficients of the factors are positive, it follows that the 
coefficients of the product are also positive. 

The following theorem gives a stability criterion for third-degree poly- 
nomials. 


THEOREM 6. The polynomial 
L(p) = aop? + arp? + asp + a3, a> ‘0,7 


with real coefficients is stable if and only if the numbers 4), ao, @3 arc 
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positive and, in addition, the inequality 


Q1Q2 > G03 


is satisfied. 
Proof. Vor the proof we shall examine the polynomial 
L(p) = p® + ap” + bp + ¢; (4) 


the case of the general polynomial L(p) can be easily reduced to this. By 
proposition (C) it is sufficient for us to prove that the polynomial (4) with 
positive coefficients a, b, c is stable if and only if the inequality 


ab>c (5) 


is valid. In our proof we shall use the fact that the roots of the polynomial 
are continuous functions of its coefficients. 

First. of all we shall determine those conditions for which the poly- 
nomial (4) has purely imaginary roots, in particular, the root p = 0, 
which must also be considered purely imaginary since it is located on the 
imaginary axis. We have 


L(p) = (p + a)(p? + b) — ab+e. (6) 


If L(p) has the root 0, then c = 0, and this case is excluded by hypothesis, 
since c > 0. Let us assume that the number tw, where w ¥ 0, is a root 
of L(p). If it is further assumed that —w? + b is not zero, then the 
number (tw + a)(—w? + b) has a nonzero imaginary part and cannot 
cancel the real number —ab + c. Thus the number zw can be a root of 
the polynomial L(p) only when —w? + b = 0; in this case we have the 
equality 
Liiw) = —ab+c=0. 


Conversely, if ab = c, then by formula (6) the polynomial L(p) has purely 
imaginary roots p = +7\/b. Thus L(p) (with positive coefficients) has 
purely imaginary roots if and only if a) = c. In particular, by a continuous 
change of the positive coefficients a, b, c, a root of polynomial L(p) can 
intersect the imaginary axis only if the equality ab = c is satisfied. 

Let us assume that (5) is not satisfied. Then either ab = c or ab < c. 
In the first case, the polynomial L(p) has purely imaginary roots and, 
consequently, is nonstable. We shall show that in the second case, i.e., 
when the inequality 

ab <c, (7) 


is satisfied, polynomial L(p) is also nonstable. We shall vary the coefficients 
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a and b continuously, keeping them positive in such a way that they tend 
to zero without violating inequality (7). During this change no root will 
pass from one side of the imaginary axis to the other, and consequently 
the property of stability or nonstability of the polynomial remains in- 
variant. When a = b = 0, we obtain the polynomial p? + c, whose 
roots ~/c (cos (7/3) + isin (7/3)) are located on the right-hand side of 
the imaginary axis. Since the roots depend continuously on the coefficients, 
the nonstability (that is, the presence of roots to the right of the imaginary 
axis) is also retained for sufficiently small positive a and b. 

Let us now assume that the inequality (5) is satisfied and show that 
L(p) is stable. For this, we shall vary the coefficient c in such a way that 
it tends to zero through positive values without violating (5). For c = 0, 
we obtain the polynomial 


L(p) = p(p? + ap + b), 


which has one zero root and two roots with negative real parts. Tor a 
small positive ¢ these two roots will not change very much, so that their 
product will remain positive and the zero root will assume a small positive 
or negative value. Since the product of all three roots equals the negative 
number —e, that root close to zero will be negative. Thus Theorem 6 is 
proved. 

In order to formulate the necessary and sufficient stability conditions 
for an arbitrary polynomial with real coefficients, we shall first fix our 
terminology. Let 


Pit Pi2 «+--+ Pin 
P= Pai p22 +--+ Pan 
Pnt Pn2 +++ Pnn 


be an arbitrary square matrix of order n. We shall call the determinant of 
the matrix 


Pil Pi2 +++ Pik 
P21 Po2 --- Pak 
(per Pro «++ Dek 


its principal kth minor, and denote it by A;,(P). Thus the determinant 
A,(P) consists of those elements of the matrix P appearing in the first 
k rows and columns. 


THEOREM 7. Let 


agp” + ayp"—| +--+ + an, ao > 0, (8) 
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be an arbitrary polynomial of degree n with real coefficients. In order 
to determine the stability of (8), the nth-order matrix 


O ... «2. Gnig Gn 


is formed. Then, the polynomial (8) is stable if and only if all princi- 
pal minors A;(Q), k = 1,..., 7, of Q are positive.* 


For further clarity, we shall describe the form of the matrix Q. The 
kth column of Q has the form 


++ Qk42 Q@k41 ApAp—1dp_2.-.., 


where the element a, lies on the principal diagonal; here, any element 
a4 ;, Whose index k + j is negative or larger than n, is taken to be zero. 


EXAMPLES 


1. We shall derive Theorem 6 from Theorem 7. In the case n = 8, 
the matrix Q has the form 


a, a3 0 
ap ag 0 
0 a, a3 


Its three principal minors have the values 
Ai(QQ) = a1, = A2(Q) = aya2 — aoag, A3(Q) = ag - A2(Q). 
The conditions that these values and the coefficient ap) be positive are 
a > 0, a, > 0, a3 > 0, 142 > 4003. 


From this set of conditions it clearly follows that coefficient ag is positive. 
Thus in the case n = 3 Theorem 7 reduces to Theorem 6. 


* Theorem 7 will not be proved completely in this book. For a proof of this 
theorem, together with a comprehensive discussion of this question, see Fuchs, 
B.A., and Levin, V.I., Functions of a Complex Variable, Vol. II, pp. 264 ff., 
Reading, Mass.: Addison-Wesley, 1961. 
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2. In the case n = 4, the matrix Q has the form 


a, 43 0 0 
Qo Go a4 0 
a, a3 0 
QO ado do a 


Its principal minors have the following values: 


Ai(Q) = 41; 

A2(Q) = a1d2 — ads; 
As(Q) = a3 Ao(Q) — ajay; 
A4(Q) = a4 A3(Q). 


The conditions that these minors be positive, together with the condition 
ado > 0, are obviously 


a > 0, a, > 0, a, > 0, a3 > 0, a, > 0, 
A3(Q) = a14243 — a9a3 — aja, > 0. 


10. The linear nonhomogeneous equation with constant coefficients. 
Here we shall give the solution of a linear equation with constant coefficients 
containing a free term of a special form which is the so-called quasipoly- 
nomial. 

(A) We define a guastpolynomial to be any function F(é) which can be 
written in the form 


F(t) = filt)es’ + foe! +--+ + fr(the’, (1) 


where )1, Ae, ..., An are complex numbers, and f,(é), fo(t), ..., fm(é) are 
polynomials in ¢. From proposition (C) of §8 it follows that every solution 
of a linear homogeneous equation with constant coefficients is a quasi- 
polynomial. Conversely, it can be proved that every quasipolynomial is 
a solution of some linear hornogeneous equation with constant coefficients. 
If any two numbers of the sequence d), Az, -.., Am coincide, for example 
if \y = Xe, then the terms of the sum (1) corresponding to these numbers 
can be combined and replaced by the term (f,(¢) + fo(t))e™’. Thus the 
expression (1) can always be reduced to a form in which the numbers 
Ay, A2,-.-, Am are mutually distinct. Let us note that the sum and product 
of two arbitrary quasipolynomials are also quasipolynomials; further, if 
we apply an arbitrary operator L(p) to an arbitrary quasipolynomial, then 
we shall again obtain a quasipolynomial. 
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Thus in the present section we shall study the equation 
L(p)z = F(8), (2) 


where F(t) is a certain quasipolynomial. Along with equation (2) we shall 
study the corresponding homogeneous equation 


L(p)u = 0. (3) 


The following proposition follows directly from (B) of §6. 
(B) If 2 is some solution of equation (2), then an arbitrary solution z of 
this equation can be written in the form 


z=é2+4, 


where wu is a certain solution of equation (3). 

Since we already know how to find an arbitrary solution of a homogene- 
ous equation, the problem reduces to finding one solution, or, as it is called, 
a particular solution, of equation (2) in the case when F(t) is a quasi- 
polynomial. Furthermore, since every quasipolynomial may be written in 
the form (1) by virtue of (C) of §6, the problem becomes one of determin- 
ing a particular solution of equation (2) in the case when F(t) = f(t)e™, 
where f(t) is a polynomial. For this case, the answer is given by the follow- 
ing theorem. 


THEOREM 8. Consider the nonhomogeneous equation 
L(p)z = f(je™, (4) 


in which f(é) is a polynomial of degree r in t, and \ is a complex number. 
Let k = 0 if L(A) ~ 0, and let k be the multiplicity of the root \ if 
LQ) = 0. Then there exists a particular solution of equation (4) of 


the form 
z= t*g(te™, (5) 


where g(t) is an rth-degree polynomial in t. The coefficients of g(é) can 
be found by the method of undetermined coefficients. 


Proof. We set 
S(t) = aot” + f*(t) (6) 


and look for a polynomial g(t) in the form 
g(t) = bot” + g*(t), (7) 


where the polynomials f*(t) and g*(t) are of degree r — 1. Further, we 
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may choose the number k in such a way that 
L(p) = M(p)(p — »)*, (8) 


where M(\) ¥ 0. In order that the function (5) be a solution of (4), the 
condition [see §8, (A)] 


L(p)eMt*g(t) = eL(p + d)t*g(t) = eMf(t) 
must be satisfied, i.e., g(t) must satisfy the condition 
L(p + tg) = fl). (9) 


The polynomial M(p + A) has as its free term the number M(A) ~ 0, 
and hence may be written in the form 


M(p +) = MQ) + M*(p)p, = MA) # 0. (10) 


Taking into account (6), (7), (8), and (10), we can write condition (9), 
as it is applied to g(é), in the form 


boM (A)p*t*t" + boM*(p)p ttt" +. Lip + Adttg*(t) = aot” + f*(t). 
(11) 


By equating the terms containing ¢” in (11), we obtain the relation 
boM(a)p*t**" = aot’, (12) 


from which the coefficient by of the unknown polynomial g(t) is uniquely 
determined since M(\) ¥ 0. We shall now assume that bo is already 
chosen, so that (12) is satisfied; then (11) takes the form 


L(p + Ajt*g*(t) = f*() — boM*(p)p*tlekt’, (13) 


where the right-hand side consists of a known polynomial of degree r — 1, 
while the left-hand side consists of an unknown polynomial g*(t) of 
degree r — 1. Equation (13) differs from (9) only by the fact that the 
degrees of the polynomials in it have been decreased by one. By repeating 
for equation (13) the calculations which were carried out earlier for equation 
(9), we calculate the coefficient b, of the term of highest degree in ¢ [in 
this case, the (r — 1)st degree] of the polynomial g*(t). By carrying out 
this process further, we can calculate all the coefficients bo, b;, ..., 0, of 
g(t) in such a way that (9) is satisfied, and, in the same way, we can finda 
solution of (4) in the form (5). 

It would be possible to substitute a solution of the form (5) directly 
into (4) and, by assuming that the coefficients of g(t) are unknown, to 
obtain for these coefficients a system of linear equations by equating the 
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coefficients of identical terms in the left-hand and right-hand sides of 
equation (4). The calculations carried out above show that the system 
of equations obtained for the coefficients of the polynomial g(#) is solvable. 
Thus theorem 8 is proved. 

Note. The system obtained for determining the coefficients of g(t) is a 
system of linear equations with a triangular matrix: By equating the 
coefficients of t’e™’, we obtain an equation containing only bp; by equating 
the coefficients of t”—'e™, we obtain an equation containing only bo and 
b,; and so on. 

We now establish an important property of quasipolynomials. 

(C) If the quasipolynomial 


F(t) = filt)e' + foes’ +--+ + fin(thern’, 


where Aj, Ag, ..-, Am are mutually distinct numbers which are identically 
zero on a certain interval r; < ¢ < re, then all the polynomials f)(¢), 
fo(t), ..., fm(t) are identically zero, so that all coefficients of the quasi- 


polynomial F(t) are zero. From this it follows that if two quasipolynomials 
F(t) and F*(é) are identically equal on the interval 7; < t < re, then 
their corresponding coefficients coincide. 

We shall prove proposition (C) by induction on the number m, which 
we shall call here the order of F(t). For m = 1 proposition (C) is valid, 
since in this case the equalities F(t) = f\(ée"’ = 0 and f\(t) = 0 are 
equivalent. We shall now carry out the induction step from m — 1 to 
m(m > 2). If F(t) is identically zero on the interval r} < t < re, then 
this is also true for the quasipolynomial 


Gi) = p(Fem'), 


where p is a differentiation operator and / is the degree of f,,(é). By propo- 
sition (A) of §8 we have 


G(t) _— gilt) 1 >m)é + go(t) e 2m) t ferret Qm—1(t) eOm—1 mt 


where 
gilt) = (p +i — Am) TY), = 7 = 1, m — 1. 


The quasipolynomial G(f) is of order m — 1, and, since it is identically 
zero on the interval 7; < ¢ < ro, it follows from the induction hypothesis 
that all the polynomials g;(é), ..., Ggmn—i(t) are identically zero. Let us 
suppose that one of the polynomials f,(), ..., fm—1(t) is not zero, and 
then show that this assumption leads to a contradiction. Let us assume 
that fi(t) is of degree k, ie., that f(t) = aot* + a)! 4+---+ a, 
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where ad) ~ 0. By direct inspection we see that 
g(t) = (P+ 1 — Am) Ff) = Ar — Am)! Frat +, 
and since g,(t) is identically zero on the interval r; < ¢ < re, we have 
(41 — Am)'t ag = 0. 


Since the numbers A, and X,, are distinct, it follows that ag = 0. This 
contradiction shows that all coefficients of the polynomials f,(é), ..., 
fm—1(t) are equal to zero, i.e., F(t) = fm(t) ed’. Hence we conclude that 
all the coefficients of f,,(¢) are also zero. 

The case of the identity of two quasipolynomials F(t) and F*(z) on the 
interval r; < ¢ < rg is reduced to the case already studied by forming 
the quasipolynomial F(t) — F*(t). Thus proposition (C) is proved. 

EXAMPLES 
1. We shall find a particular solution of the equation 
Z+2=tcost = He’ + de“, (14) 
We solve separately the equations 
gt2=— Me, (15) 
Z4+2= hte“. (16) 


It is evident that, if z is a solution of (15), then Z is a solution of (16). 
Thus it is sufficient to solve only equation (15). Herer = 1,4 =7,k = 1. 
Therefore a particular solution must be found in the form 


t(c! + ete. 
The relation (9) takes the form 


((p + 2)? + I(clt + c7t?) = 4B, 


or 
(p? + Qip)(c't + ct?) = de. 
This gives 
Qe? + 2ic! + 4ic*t = Ht, 
whence c? = —}i, c' = ic? = 4. Thus the particular solution of (15) 


has the form 


1 t i 
z= (fe geyer, 


11] METHOD OF ELIMINATION 67 


and the solution of (14) is found to be 


2 
cost + Z sin ¢. 


a] oe 


5 — lit —ity , Lape iy 
z+z= gle +e )+ ge ej) = 
2. We shall study the function 
f(t) = cos 2t - cos 3¢ - e*. 


Since every factor cos 2t, cos 3t, e*' is a quasipolynomial, their product f(t) 
is also a quasipolynomial. Let us reduce this quasipolynomial to the 
form (1): 

erit 4 goBit Bit 4 git 

— 2. 2 °¢ 


_ (445%)¢ 1,(4+7)t (4—1)t (4—57)t 
= qe + 4e + ge + qe a 


cos 2t- cos 3t+ e*! = 


The reduction of polynomials to form (1) is useful in the solution of 
nonhomogeneous equations on the basis of Theorem 8. 


11. Method of elimination. Until now we have been occupied with the 
solution of only one linear equation with constant coefficients. However, 
a quite general system of linear equations with constant coefficients may, 
in a certain sense, be reduced to one equation. This reduction is realized 
by the method of elimination, which is similar to that used in the theory of 
algebraic (not differential) linear equations. We shall explain this method 
here and then draw certain conclusions. 

We shall investigate the system of equations 


DY Lip)e =f",  §=1,...,, (1) 
s=1 
where x!, ..., 2” are unknown functions of the independent variable 1, 
and f'(),...,f"(é) are given functions of time é. Each symbol L2(p) repre- 


sents a polynomial with constant coefficients with respect to a differentia- 
tion operator p, so that one term, L/(p)z°, represents a linear combination 
with constant coefficients in the function x and its derivatives. The num- 
ber of equations in system (1) is equal to the number of unknown functions. 

The order of the system (1) with respect to the unknown function 2° is 
denoted by q,, so that the genera] order of (1) is determined by the formula 
q=@+42+-::+ 4a. In posing the problem of solving (1), we 
naturally must assume that every unknown function z* has derivatives up 
to the order q, inclusive; no assumption on the existence of derivatives of 
higher orders is contained in the statement of the problem. 

In applying the method of elimination to (1), we assume that each of 
the unknown functions x’, along with each of the functions f?(t), has an 
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appropriate number of derivatives. In making these assumptions, we are 
narrowing down, on the one hand, the class of the solutions under con- 
sideration (assumption on the adequate differentiability of the unknown 
functions), and, on the other hand, we are narrowing down the class of the 
equations under consideration [assumption of adequate differentiability 
of the functions f()]. The first of these restrictions can be removed by 
proving that if x',..., x” is a solution of (1) and if the right-hand sides of 
f?(é) have an appropriate number of derivatives, then each of the func- 
tions x* has an appropriate number of derivatives (see Examples 3 and 4). 

Let us proceed to a description of the method of elimination. 

(A) Let us study the matrix 


Li(p) ... La(p) 
: (2) 
Li(p) ... Ln(p) 


of the system (1). Every element Li(p) of the matrix (2) is a polynomial 
in p. Thus it is possible to calculate the determinant D(p) of the matrix 
(2) and its minors. The cofactor of the element Li(p) of (2) (i.e., the minor 
of this element taken with the proper sign) will be denoted by M}(p). 
Trom any course in higher algebra it is known that the identity 


>. Mi(p)Li(p) = 8:D(p) (3) 
j=l 
is valid, where 6: is the so-called Kronecker symbol: 
ss = 1, = 0 for t #8. 


Multiplying equation (1) by Mi(p) (ie., carrying out a series of differ- 
entiations, multiplications and additions) and then summing up with 
respect to j, we obtain the equality 


y> Mip)Lipye’ = Mio. (4) 
j,s=1 J 


[In going from (1) to (4), we have used the differentiability assumptions 
imposed on the functions x* and f?(t)]. By virtue of (3) the equality (4) 
can be rewritten in the form 


D(p)x* = DY Mis’. (5) 


The system (5), 7 = 1, ..., », has the property that every unknown 
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function x’ appears in only one equation of (5). We have thus proved that 
if the system of functions z!, ..., 2” is a solution of (1), then each indi- 
vidual function z* is a solution of (5). 

It should not be supposed, however, that if for every number 7 a solution 
x’ of equation (5) is selected in an arbitrary manner and the system of 
functions x!,..., 2” is then formed, the system of functions obtained will 
be a solution of system (1). In order to find the general solution x!,..., x” 
of (1), it is necessary to find the general solution x* of every equation (5), 
i = 1,..., 7, then form the system of functions z', ..., x”, and finally 
determine those conditions (the relations between the constants of inte- 
gration) under which this system of functions satisfies the system (1). 

We shall now draw certain conclusions from the method of elimination. 
First we shall formulate the result obtained in proposition (A) for the case 
of the homogeneous system of equations 


n 


> Lip)t® = 0, j=1,...,7. (6) 
s=1 
(B) If the system of functions z!,..., x” is a solution of the system (6), 


then each individual function z* appearing in this solution satisfies the 
equation 
D(p)x* = 0, 


where D(p) is the determinant of the matrix (Li(p)) of the system (6). 
In particular, it follows from this that if the determinant D(p) is a stable 
polynomial [see §9, (A)], then every solution z!,..., x” of (6) satisfies the 
inequality 

(1)? +++++ (2)? < R%e** for ¢ > 0, (7) 


where @ is a positive constant depending on the system (6), and # is a 
constant depending on the solution z!,..., x”. 

The inequality (7) follows directly from inequalities (3) of §9. 

We shall now show how the system of equations (6) can be solved by 
using the elimination method. 

(C) Let us assume that the determinant D(p) of (6) does not vanish 
identically, and let A be a root of multiplicity k of D(p). We shall seek a 
solution of (6) in the form 


z= g*(te, s=1,...,%, (8) 
where 


g'(t),.--, 9°) (9) 


is a polynomial of degree k — 1. By substituting the functions (8) into 
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(6), we obtain [see §8, (A)] 


O= D> Lilp)g’ He’ = Do eMLi(p + dg), f= 1,.--, 05 
s=1 


s=1 


after factoring out e*! we have 


YS Liipt+ rd =0, F=h...,n (10) 


s=1 


Thus, the system of functions (8) is a solution of (6) if and only if the 
polynomials (9) satisfy (10). Each term 


Lip + »)g°(0) 


of the left-hand side of (10) is a polynomial in ¢, and its coefficients are 
linear homogeneous functions of the coefficients of the polynomial g*(t). 
Thus the set of conditions (10) is equivalent to a certain system of linear 
homogeneous equations in the coefficients of the polynomials (9). From 
the theory of linear homogeneous algebraic equations it follows that, by 
finding the coefficients of the polynomials (9) from this linear system, we 
may take a certain number of these coefficients as arbitrary and express 
the remaining ones in terms of them. (The case where the number of 
arbitrary coefficients is equal to zero is not excluded a priori.) The solu- 
tions (8) of the system (6) obtained in this manner will be called the 
solutions corresponding to the root A. It is clear that these solutions are 
defined for all values of t, —0 <t < o. 


THEOREM 9. Let us assume that the determinant D(p) of the system (6) 
does not vanish identically, and let 


hy, Nay ey Am 
be the set of all distinct roots of the polynomial D(p). Then an arbi- 
trary solution x!,..., 2” of (6) can be written in the form 
eaate:- +a, s=1,...,”, (11) 
where 
vi,a?,...,2% 


is some solution of system (6) which corresponds to the root A, [see (C)]. 
Hence, in particular, it follows that every solution of (6) is defined for 
all values of ¢. 
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Proof. Let us assume that 


is a solution of (6) which is defined on the interval ry; < t < re; we shall 
show that on this interval it can be written in the form (11). By proposi- 
tion (B) every function 2° on the intervalr; < ¢ < r2 satisfies the equation 


D(p)x* = 0, 


so that on the same interval by proposition (C) of §8 it can be written 
in the form 


™m 
r= > gi(tyer*?, s=1,...,”, (12) 
i=l 


where g(t) is a polynomial whose degree is less than the multiplicity of 
the root \;. By substituting the system (12) into (6), we obtain [see §8, (A)] 


0= SY Looe => (e > Lip + rato), 
s=1 


s=1 i=1 t=1 


j=l1,...,n. (13) 


Since ©, Li(p + 2,)g%(t) is a polynomial in ¢ and the numbers )y,..., 
Am are mutually distinct, it follows from proposition (C) of §10 and from 
(13) that 


>> Lilp + dg) = 0; t=1,...,m, f=l,...,n. (14) 
s=1 
Multiplying (14) by e*#', we obtain 
0 = Doe Lip + rAdg) = DY Lalp)gie™, 
s=1 


s=1 


which shows that 


a= ge, s=1,...,n, 
is a solution of the system (6) for each z = 1,..., 7. Thus Theorem 9 is 
proved. 
EXAMPLES 


1. We shall solve by the elimination method the system of equations 


B+oet +e =0, #-—2t'+#?+2°7=0. 
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We rewrite this system symbolically as 
(p + 1)x* + px? = 0, 
(p? — la! + (p? + 1)x? = 
The determinant of the system, as is easily seen, is equal to p? + 2p + 1; 


it has a double root \ = —1. According to Theorem 9, the solution of the 
system should have the form 


a} 


(at + b)e~*, 
(ct + d)e* 


a? 


Substituting these functions into the first equation (and factoring out e—‘), 
we have 
at+e—c—d=0O, 
whence 
c= 0, 
a=d 


The same relations for the coefficients may also be obtained by substitution 
into the second equation of the system. Thus the general solution of the 
system under consideration may be written in the form 


zi = (at+ be, 


x? = ae, 


where a and b are arbitrary constants. 
2. We shall apply the elimination method to the normal system of 
linear homogeneous equations with constant coefficients, 


n . 
P= SS aie’, j=1,...,%, (15) 
s=1 


(such a system will be studied more thoroughly in §14). We rewrite the 
system (15) by using the symbolism 


n . 
I 8 M 
=) a;r, j= l,...,”, 
s=1 


or 


n 
2 (a a—pe)r®’=0, j=l,...,n, (16) 


where 6? is the Kronecker symbol. The system (16) is a special case of the 
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general system (6), in which 

Li(p) = a3 — pds, 
and the determinant D(p), in this case, is the characteristic determinant 
of the matrix (a2) of the system (15). The solution of (16) must now be 
obtained by the method of undertermined coefficients described in Theo- 


rem 9. In the particular case when the roots d,,..., An of D(p) are simple, 
there corresponds to every root A; the solution 


ai = git! s=1,...,%”, (17) 
where 


a (18) 


are polynomials of zero degree, i.e., numbers. Substituting the functions 
(17) into (15), we obtain for the undetermined coefficients (18) the system 


of linear equations 
n 


igi = D> alg’, j=l,...,n, 


s=1 


which shows that the numbers (18) are the components of the eigenvector 

of the matrix (a3) with eigenvalues A,. Since all eigenvectors corresponding 

to the same eigenvalue A; are proportional in the case of simple roots, then 

by denoting the components of any eigenvector with eigenvalues \; by 
hi, hi, sre hi, 

we obtain 


8 ips 
gi = chi, s=1,...,n. 


Thus the general solution of (15) in the case of simple roots may be written 
in the form 


nr 
oa hk 
a= >> chic! s=1,...,%”, 


i=] 


where c!,..., c” are arbitrary constants. 
3. We shall study the linear system 


Yi Lp)" =f), § = Aye, (19) 
s=1 


with constant coefficients [see (1)]. Let g; be its order with respect to the 
unknown z’ and let 


q=Hutdates* +4 
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be the order of the system (19). Further, let 


Li(p) ... Ln(p) 
( (20) 

i(p) .-. Ln(p) 
be the matrix of (19) and D(p) its determinant. We shall show that the 
degree of the polynomial D(p) does not exceed the number gq. If this degree 


is equal to qg, then we shall call the system (19) normalizable. In this case 
it can be solved for the higher derivatives 


(x), 22. , (x), (21) 


and therefore it can be reduced to a normal system [see §4, (B)]. 
By assumption, the degree of the polynomial Z7(p) does not exceed the 
number qg., SO we can write 


Li(p) = aip +---, (22) 


where the dots denote terms of degree lower than qg,. By calculating the 
determinant D(p) of (20), while taking into account formula (22), we 
have at once that 

Dip) = Api +---, 


where A is the determinant of the matrix (a’). In this formula we have 
omitted terms of degree lower than g. Thus we have established that the 
maximum possible degree of the polynomial D(p) is q, and if this degree is 
equal to gq, then A ~ 0. If we select those terms in (19) with the highest 
derivatives (21), we arrive at the system 


> ale) +... =f), jFod,...,n (23) 


s=1 


Thus, if (19) can be normalized, then A ~ 0 and the system (23) is solva- 
ble for highest derivatives (21). 

Since the normalizable system (19) reduces to a normal one, then, by 
what was said in Example 3 of §3, every solution of (19) has an arbitrary 
prescribed number of derivatives if only the right-hand sides f#(#) of (19) 
are differentiable an appropriate number of times. 

4, We shall now study the case when the determinant D(p) of the sys- 
tem (19) is not identically zero, but the degree of D(p) is less than the order 
q of the system (19). We shall show that even in this case every solution of 
(19) has any prescribed number of derivatives, if only the right-hand sides 
f?(@) are sufficiently differentiable. 
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By hypothesis, the degree of D(p) is less than q, so that the determinant 
Ais zero. Thus, the columns of the matrix (a) are linearly dependent; let 


b!, ..., b” be the coefficients expressing this dependence. Some of the 
numbers b',..., 6” may be zero. We shall change the numbering of the 
functions z',..., 2” so that the relations 
b! = 0, b? ~ 0, ..., 0 #0, pmti—... = oP = 0, 
l<m<n, (24) 
M292, 1 2 43, ---, M1 2 Im, 
are valid. Since by (24) we have b! ¥ 0, we may assume that 6! = 1. 
We shall now replace the indeterminates x’, ..., 2” by new indetermi- 
nates y!,..., y” by setting 
aa yts ahs yt bipunty!, i = 2,...,m; 
gay, ti=mti,...,n. (25) 
The relations (25) can be solved for the new indeterminates y!, ... , y”, 
namely, 
yaa; y= at— bipntz!, 4 =2,...,m; 
y=, t=m+1,...,n. (26) 
Substituting the new indeterminates y!, ..., y” into (19) in place of 


1 


xz*,..., 0", we obtain the new system of equations 


LH py =f, §=4,-..)0. (27) 
s=1 


It is immediately evident that the order g7 of the system (27) with respect 
to the function y! is less than g,, and its orders with respect to the remain- 
ing unknowns y”,..., y” are correspondingly equal to q2,..., 4. Thus 
the order q* of (27) is less than the order gq of (19). 

If we consider the transformations (25) and (26) as linear transformations 
of the variables y', ..., y” into the variables z!,... , x”, and conversely, 
with coefficients which are polynomials in p, then it is evident that the 
determinant of each of the linear transformations (25) and (26) is equal to 
+1. From this it follows that the determinant D*(p) of (27) is equal to 
the determinant D(p) of (19). Thus the difference between the order and 
the degree of the determinant in system (27) is less than in system (19); 
by applying the transformation described a finite number of times, we 
obtain a normalizable system. 


Now let ; 
x= ¢'(t), t= 1,...,n, (28) 
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be some solution of the system (19). Since the order of (19) with respect to 
the unknown 2’ is equal to qg;, the function ¢’(é) may be assumed to be 
differentiable g; times. By virtue of the transformation (26) the solution 


y = v(), t= 1, SEL 2) (29) 


of (27) corresponds to the solution (28) of (19). From (26) it is evident 
that y’(t) is differentiable g; times. Consequently, from every solution (28) 
of (19) we can obtain a certain solution (29) of (27), so that no solution is 
lost in going from (19) to (27). Since, after a series of transformations, we 
arrive at a normalizable system whose solutions have any given number 
of derivatives, it is evident from (25) that a solution (28) of (19) also has 
any prescribed number of derivatives. 


12. The method of complex amplitudes. In many branches of engineer- 
ing and physics dealing with oscillating processes, harmonic oscillations 
play an important role. Mathematically, a harmonic oscillation is defined 
by a function 

r cos (wt + a), r> 0. (1) 


Here r is the amplitude of the oscillation, a is its initial phase, and the 
number w, determining the frequency of the oscillation, is usually called 
the frequency. Actually, if v is the number of oscillations per second, then 


w = 20, 


so that w is the number of oscillations in 27 seconds, not in one second. 
We also noticed (see Example 1, §4) that the equation 


E+ wr = 0 (2) 


has as its general solution the harmonic function (1) of frequency w with 
an arbitrary amplitude and phase. Equation (2) is called the equation of 
the harmonic oscillator. 
In the study of harmonic oscillations it is often necessary to deal with 
the equation 
L(p)x = rcos (wt + a), (3) 


where the right-hand side contains a harmonic function. Equation (3) 
is easily solved by using the method set forth in Theorem 8, since the 
harmonic function is a quasipolynomial. In the case when the coefficients 
of the polynomial L(p) are real, Theorem 8 can be utilized in a somewhat 
different way. In electrical engineering this method is called the method 
of complex amplitudes and is explained in what follows. 
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(A) In conjunction with the real harmonic function (1) we shall study 
its corresponding complex harmonic function 


pe, (4) 
where 
p = re. (5) 


The function (4) has the property that its real part coincides with the 
function (1): 


pei? = ret) — ¢ cos (wt + a) + irsin (wt + a). 


The complex number (5) is called the complex amplitude of the complex 
harmonic function (4); it combines the amplitude r and the initial phase a. 
Let us note that 

r = (pl. 


Tn the case that the coefficients of L(p) are real, the equation 


L(p)z = pe’ (6) 


is solved as a preliminary to the solution of equation (3). It is immediately 
evident that, if z2 = x + zy is a solution of (6), then z is a solution of (3). 
Assuming that zw is not a root of L(p), 


L(iw) ¥ 0, (7) 


we seek (see Theorem 8) a solution of equation (6) in the form of a complex 
harmonic function z = oe*”’ with complex amplitude o = se”. By sub- 


twe 3 


stituting z = ge*”’ into (6) we obtain 


_ _?p 


[see §7, (B)]. Thus a solution of (8) is given by 
x = scos (wt + 8). (9) 


The amplitude s and initial phase @ of this solution are determined from 
the formula ; 
se — TO 
L(iw) 
[see (8)]. In particular, s = |o| = r/|L(@w)|. If the polynomial L(p) 
is stable, then (7) is obviously satisfied. In this case the general solution 
of equation (3) has the form 


x= u+scos (w+ 8), (10) 
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where wu is the solution of the homogeneous equation L(p)u = 0. The 
solution u of this homogeneous equation tends to zero as t > oo, and 
therefore any solution of (3) tends to the solution (9). The solution (9) 
is called stable and it corresponds to a steady-state process, while the solu- 
tion (10) describes a transient process. The stable solution (9) is the unique 
periodic solution among all solutions (10). 

In applying the method of complex amplitudes, we usually do not study 
the solutions of the real equation (3); instead, we proceed directly from 
the complex equation (6). 

We shall now present the method of complex amplitudes as it is applied 
to a system of equations. The problem is to find a particular solution of 
the system of equations 


> Li(p)a® = r cos (wt +a’), j=Hl...4m (11) 
s=1 
with real coefficients, whose right-hand sides are harmonic oscillations of 
the same frequency w. 
(B) We shall assume that the determinant D(p) of the system (11) 
[see §11, (A)] does not vanish at p == tw. To find the solution of (11), we 
shall first find the solution of the system of equations 


Ds Lipa = pe, GF = 1.4m (12) 
k=1 


where . 
. ar, 
p) = riel@, 


Since the coefficients of all the polynomials Lj(p) are real, we shall obtain 
the following solution of (11) from any solution z', ..., 2” of (12): 


z* = Re2z*, k= 1,...,m”. 
We find the solution of the system (12) in the form 
ze ok! k= 1,...,n. (13) 


By substituting the functions (13) into (12) and factoring out e*’, we 
obtain the system of equations 


n - 
dy Liliw)o* = p’, 
k=1 


which can be solved uniquely for the unknowns o*, because its determinant 
D(iw) is assumed to be different from zero. We shall find solutions of this 
system and assume that 


ook 
ok = ght. 


12] THE METHOD OF COMPLEX AMPLITUDES 79 


so that by (13) we obtain the solution 
z* = s* cos (wt + 8"), k=1,...,n, (14) 


of (11). If the determinant D(p) of (11) is a stable polynomial, then the 
inequality D(iw) 0 is satisfied, and in addition every solution of (11) 
differs from the solution (14) by a term which tends to zero as t > 0 
[see §11, (B)]. Thus in the case of a stable polynomial D(p), the solution 
(14) of (11) not only consists of the particular solutions, but is itself a 
stable solution. 


EXAMPLE 
We shall solve the equation 
# + wir = rcos (wt + a) (15) 


of the harmonic oscillator which is subject to an external harmonic force. 
Instead of (15) we shall study the corresponding complex equation 
E+ wie = reort. (16) 
If w * w,, then (16) has a solution of the form z = oe“, so that by (8) 
ree 
°= Be 


Thus (15) has the solution 


— 


cos (wt + 8B), (17) 


|w? — w?| 


where 8 = a for w; > w and 8 = a+ @7 for w, < w. Formula (17) 
gives the forced oscillations of the oscillator under a harmonic external 
force. Here it is important to note the phenomenon of resonance, which 
incorporates the fact that the amplitude 


fr 
|w? — w?| 


of a forced oscillation increases as the difference |w; — w| decreases. It 
is also interesting to note that the phase £ of the oscillation (17) coincides 
with the phase a of the external force for w; > w and is opposite to it for 
w, < w. The general solution of (15) may be written in the form 


x = 71 cos (wy + ay) + | cos (wt + 8B), 


a 
w? — w?| 
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where u = 7; cos (wit + a) is a solution of the corresponding homo- 
geneous equation. The term ux is called the natural oscillation of the os- 
cillator. 

If w; = w, then the formula (17) loses its meaning. In this case the 
solution of (16) must be sought in the form 

z= pte, 
where p is a complex number (see Theorem 8). By formula (9) of §10, we 
have 
[(p + tw)? + w?lpt = re’, 


whence 
ja 


_ ree 
P= Dia 


Thus for w; = w the particular solution of equation (16) has the form 


rieietta) rie et te—Gl2)1 
z=—35,, = ; 


2iw 2w 


and the solution of equation (15) proves to be 


rt T rt 
r= 2 cos (wt + a — z) = Dg) Sin (wt + a). 
Thus for w = w,, the phenomenon of resonance involves the fact that the 
amplitude ri/2w becomes variable and increases without limit with time. 
In a real apparatus this phenomenon cannot be observed due to the 
presence of “friction.” 


13. Electrical circuits. The theory of ordinary differential equations 
finds applications in various fields of engineering; it is applied in electrical 
engineering and in particular in radio engineering. With some idealization, 
the performance of a radio device can be mathematically described by a 
system of ordinary differential equations; the values of currents flowing 
through various elements of the device or the voltage drops between 
individual junctions of the device are the unknown functions of time in 
this system. Radio devices afford a very rich source of material illustrating 
the application of the theory of ordinary differential equations—much 
richer than, for example, the problems of mechanics. This richness is 
characterized in particular by the fact that a system of ordinary differential 
equations arising from a certain engineering problem often can be simulated 
by an electrical device, i.e., an electrical device may be designed whose 
performance is described by the same system of equations as the enginecr- 
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ing object in which we are interested. An electrical device so designed can 
help solve the system of equations, since in observing its performance we 
are at the same time observing the behavior of the unknown functions 
which satisfy the system of equations. The physical laws governing the 
performance of electrical devices are formulated so simply that they can 
be easily communicated even to a beginner in physics. Here, in a some- 
what dogmatic form, the simplest laws of electrical engineering are given 
along with several examples of the application of differential equations to 
the study of the performance of electrical devices. 

Resistors, inductors (self-induction), and capacitors (condensers) are 
some of the most important components from which electrical devices 
are designed. Each of these components is a two-ferminal element, i.e., has 
two contacts which in assembling an electrical device are connected to the 
terminals of other components. During the operation of an electrical 
device, an electric current passes through the two-terminal element 
installed in the device, and the electrical state of the two-terminal element 
at each instant ¢ is characterized by two values: the current Igp(t) which 
flows from the pole a to the pole b of the two-terminal element ab, and the 
voltage drop U,,(f) from pole a to pole b. The current J,3(¢) can take 
positive as well as negative values; if the current flows from pole a to pole b 
(keeping in mind the so-called technical direction of the current), then the 
number J,,(¢) is positive, and in the opposite case it is negative. The volt- 
age drop U,,(¢) from pole a to pole b is the difference V.(£) — V2(t) of the 
potentials at the pole a and b. Thus both values, J,:(¢) and U,(t), which 
describe the state of a two-terminal element ab at instant t, depend on 
which pole is put in the first place and which in the second. When the order 
of the poles changes, each of the values J,,(f) and U,»(t) obviously changes 
its sign, so that we have the relations 


Toa(t) = —TLeap(t), (1) 
Ura(t) = —Uas(t). (2) 


Tor every two-terminal element ab, the functions J,,(t) and U.s(t) of 
time ¢ are not independent, but are related to each other by physical laws 
governing the performance of a two-terminal element. [or resistance, 
inductance, and capacitance, these laws are given by the following 
proposition: 

(A) lor a two-terminal element ab representing resistance, the relation- 
ship (Ohm’s law) 

Uat) = Ravlar(t) (3) 


is valid; here Rap is a positive coefficient called the resistance, which 
for different two-terminal elements can take different values but which is 


82 LINEAR EQUATIONS WITH CONSTANT COEFFICIENTS [CHAP. 2 


constant for a given element; here we always have 
Roa = Rap. (4) 


For a two-terminal element ab which represents inductance, the relation 
d » 
Uartt) = Lava Tar(t) (5) 


is valid; here Lg, is a positive coefficient called the inductance which can 
take different values for different two-terminal elements but which is 
constant for each given element. Here 


La = Lap. (6) 


For a two-terminal element ab which is the capacitance (the condenser) 
the relationship 


Tas(t) = CoS Uar(t) (7) 


is valid, where Cap is a positive coefficient called the capacitance, which 
can take different values for different two-terminal elements but has only 
one value for a given element; here we have 


Cap = Coa. (8) 


If we integrate the relation (7), we obtain 


t 


Uasld) = Ueslte) + a [rato 0) 


to 
The function 
Qar(t) = CorUar(t) 


represents a physical quantity describing the state of the condenser at a 
given instant and is called the charge of the condenser ab. The relation (9) 
is often written in the form 


Uar(t) = a | Tav(t) dt, 


where fJ,,(t) dt represents the charge of the condenser. 
Relation (4) follows from (1), (2), and (3): 


Ravlas(t) = Uar(t) = —Uralt) = — Roalra(t) 
= Rsa(—Iba(t)) = Real av(é). 


The relations (6) and (8) are established in a similar way. 
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An important role in the performance of electrical devices is given to 
the phenomenon of mutual induction between two inductances. 

(B) Two inductances a,b, and adgbe with values Lab, = [, and 
Lao, = Lz can be in a state of mutual induction which is described by the 
coefficient of mutual induction, M = Moa,b,,a9b.. In this case the voltage 
drop Ua,5,(f) = U,(t) in a two-terminal element a,b, is related not only 
to the current J.,»,@) = J,(t), but also to the current J,,5,(f) = Jo(t). 
In the same way, the voltage U,,5,(£) = Ue(t) in a two-terminal element 
Agb2 is related not only to the current J,(t) but also to the current J,(é). 
The exact relations are given by the formulas 


Ui) = Lit <1) + us 7 fal), (10) 


Url) = Le al) + MEN. (11) 
In this case the equalities 
M a,b, ,00bs = Magbo,a1b1 = —M ayb,,bo02 


are valid for the coefficient of mutual induction M4,b,,0.b., a8 well as the 
inequality 
M? < IyLx. 


The greater the “interaction” of two inductances, the more closely the 
coefficient of mutual induction M approximates in magnitude the quantity 
V/1yL.. 

The two-terminal elements described in proposition (A) are called 
passive because they cannot themselves initiate an electrical phenomenon 
in a device. The active two-terminal elements which serve as the direct 
cause of electrical currents in a device are voltage sources and current 
sources. 

(C) In the two-terminal element ab which is a voltage source, the 
relationship 


Ualt) = U(t) (12) 


is valid, where U(é) is a given function of time ¢ describing a voltage source. 
Equation (12) may be considered as relating the functions U,,(¢) and 
I(t), except that this relation is such that the function J,,(f) does not 
appear init. For a current source ab, the relationship 


Tarot) = I(t) 


is similarly valid, where J(¢) is a given function of ¢ describing the current 
source. The voltage sources and current sources most often studied are 
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those for which U(é) and J(¢) are either constants or periodic functions of 


the form 
r cos (wt + a). 


These are the principal, and at the same time the simplest, components 
from which electrical devices are assembled. The devices themselves are 
called electrical networks, and the components from which they are as- 
sembied are called their elements. It should be noted that elements exist 
which are different from those described above; in particular, there exist 
multiterminal elements. An example of a three-terminal element is the 
electron tube (triode), whose performance will be discussed later (see §29). 

Kirchhoff’s laws. We shall now proceed to formulate Kirchhoff’s laws, 
which govern the performance of electrical networks. 

(D) An electrical network is defined as a finite set of elements (in particu- 
lar, two-terminal elements of the form described above) with poles that 
are connected in so-called “junctions” of the network so that at each 
junction two or more poles of the various elements of the network are 
joined. Kirchhoff’s first law asserts that the sum of all currents entering 
each junction of a network from all elements connected to this junction is 
equal to zero. Kirchhoff’s second law follows from the proposition that at 
each junction a of a network there is an electric potential V,(f), and the 
voltage drop U,,(é) from junction a to junction b is the difference between 
the potentials in junctions a and b, so that Ua (t) = Vat) — V(t). 
Irom this assumption it follows that, if a,b, c,...,h, kts a certain sequence 
of junctions in an electrical network, then the relation 


Ualt) + Usc(t) +--+ + Unx(t) + Une(t) = 0 


is valid. This relation is Kirchhoff’s second law. It is stated as follows: 
The sum of voltage drops around a closed circuit of a network ts equal to zero. 
We do not assume in these formulations of Kirchhoff’s laws that all the 
elements have two terminals. We shall now define more clearly Kirch- 
hoff’s laws as applied to networks composed of only two-terminal elements. 

(E) Let S be a certain electrical network composed of two-terminal 
elements. Kirchhoff’s first law states that, if ais an arbitrary junction of a 
network S, and if b,a, bea, ..., bga is the set of all two-terminal elements 
connected to the junction a (Fig. 12), then 


Tp, a(t) + Tyna(t) Sa Ty a(E) = 0. 


Kirchhoff’s second law states that if ab, bc, ..., hk, ka is a sequence of 
two-terminal elements in a network S (each successive two-terminal ele- 
ment starts at that junction at which the preceding one ends (Fig. 13)], 


then 
Uar(t) + Unelt) + +++ + Uns(t) + Unelt) = 0. 
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Figure 12 Figure 13 


To calculate the performance of an electrical network consisting of 
two-terminal elements we must find the current and voltage in each two- 
terminal element in the network; thus if the network consists of n two- 
terminal elements, we face the problem of finding 2n functions of time. 
The law governing the performance of each two-terminal element gives one 
relation between the functions sought so that by this law we obtain n 
relations for 2n unknown functions. The remaining 7 relations are given 
by Kirchhoif’s laws. (It can be proved that Kirchhoff’s laws give exactly 
n independent relations, but we shall omit this proof.) Asa result of using 
all the relations, we obtain a system of 2n equations for 2n unknown func- 
tions. These equations are partly differential and partly finite (alge- 
braic). Kirchhoff’s laws give finite equations which must first be used for 
eliminating part of the unknown functions. For this elimination one of 
the following two methods is usually employed. The first method relies 
on the fact that it is possible to take currents for basic unknown functions 
and express the voltages in terms of them. In this case it is first necessary 
to use Kirchhoff’s first law: to express all currents in terms of the minimal 
number of independent currents (by this law). Such independent currents 
are called loop currents. After this, Kirchhoff’s second law is to be used by 
replacing every voltage by its expression in terms of the corresponding 
current. This method is called the method of loop currents. In the second 
method the voltages on the two-terminal elements are taken as the basic 
unknown functions, and the currents are expressed in terms of these 
voltages (with the aid of laws governing the performance of each two- 
terminal element). In this case it is first necessary to express all voltages 
in terms of the minimal number of independent voltages by means of 
Kirchhoff’s second law. Independent voltages are called nodal voltages. 
Next it is necessary to use Kirchhoff’s first law by replacing every current 
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by its expression in terms of the corresponding voltage. This method is 
called the method of nodal voltages. 

Operational impedance of a two-terminal element. Before proceeding to 
the analysis of examples of electrical network calculation, we shall write 
relations (3), (5), (9), (10), and (11), ie., the laws governing the per- 
formance of two-terminal elements, in a symbolic notation. 

(F) Let ab be a two-terminal element representing resistance, inductance, 
or capacitance. Let us set 


Var(t) = U(t), Tap(t) = I(t), 
Ra = R, Loa = L, Ca = C. 


If in addition to the symbolic notation previously used [see §7, (A)], we 
introduce the natural notation (1/p)f(é) = ff(r)d7, then (3), (5), and 
(9) can be expressed by one formula 


U(t) = Z(p) I(t) (13) 
(Fig. 14), where, respectively, Z(p) = R, Z(p) = Lp, Z(p) = 1/Cp. 
The function Z(p) is called the impedance of the two-terminal element ab 


in operator form, or the operational impedance. For the capacitance it is 
not a polynomial, but a rational function 1/Cp: 


U(t) = os (0). (14) 


The relation (14) after multiplication by Cp acquires the usual form 
IQ) = CpU(, which is merely a polynomial in p. If we set 


then relation (13) takes the form 
IQ) = G(p)U(. 


The function G(p) is called the admittance of two-terminal element ab in 
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Fig. 15. Standard symbols of certain elements of electrical networks. 


operator form and has the corresponding form: 


1 1 
Gp) =p = GQ) = Tp’ G(p) = Cp. 
The relations (10) and (11) in operational notation have the form 


Ui(t) = Inply(t) + Mpl2(d), 
Uo(t) = Lopla(t) + Mply(t). 


We shall proceed to the analysis of some examples. As a visual aid, elec- 
trical networks are represented graphically by a point for every junction; 
and for every two-terminal element, a straight line segment or a curve for 
the connection to the corresponding junctions; on every such segment 
the corresponding two-terminal element is represented by the conventional 
designation (lig. 15). 


EXAMPLES 


1. (Oscillatory loop.) Let S be an electrical network with four junc- 
tions a, b, c, and d consisting of four two-terminal elements ab, be, cd, and 
da (Fig. 16). The element ab is the inductance L, bc is the resistance PR, cd 
is the capacitance C, and, finally, the element ad is a voltage source 
Usa(t) = U(é). For the calculations we shall use the method of loop 
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currents. By applying Kirchhoff’s first law to junction b, we obtain 
Ia(t) + Ia(t) = 0, or Ia(t) = I(t). This is always the case when 
exactly two two-terminal elements are connected to one junction. Thus 


we have 
Tap(t) = TIoc(t) = Tea(t) = Iaa(t) = I(t). 


Here I(é) is the loop current. Further, by writing for every two-terminal 
element the law governing its performance, we obtain 


Vas(t) = Lpl(t), U;-(t) = RI), 


Veal) = G10, Uaalt) = — UO. (5) 
Kirchhoff’s second law gives 
Uao(t) + Use(t) + Ucalt) + Uaalt) = 0. (16) 
From (15) and (16) we obtain 
1 
L R+——-)I1@0 = UM. 17 
(p+ k+ aq) 10 = UO (17) 


Both sides of (17) can be multiplied by p (which involves differentiation 
term by term), whence we obtain 


(t* + Rp + *) I(t) = put. (18) 


This is the differential equation of the network undcr examination. 
If the two-terminal element ad is removed from the network, we shall 
obtain a so-called open circuit, consisting of three passive two-terminal 
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elements ab, bc, cd. This network (as a whole) can be considered as a 
two-terminal element with poles a and d (lig. 17). The law governing 
the performance of such a two-terminal element is given by relation (17), 
which is analogous to (13). Here the function Z(p) = Lp + R+ 1/Cp 
is an impedance in operator form and its inverse, 


_ Cp 
G(p) = LCp?2 + RCp + 1’ 


is an admittance in operator form. 

Tf we set U(t) = 0, then this will be equivalent to the assumption that 
in our network the active two-terminal element ad is absent, and the 
network consists of three passive two-terminal elements ab, bc, cd, where 
the junctions a and d coincide (Fig. 18). The equation describing the 
performance of this passive electrical network S* has the form 


(a + Rpt+ ) I(t) = 0. (19) 


As previously noted, electrical phenomena do not arise by themselves in 
a passive electrical circuit, and this is reflected by the fact that the particu- 
lar solution of equation (19) is the function J(¢) = 0. It is possible, how- 
ever, to study the performance of the electrical network S* by first assum- 
ing that it already has a current, and then determining how this current 
will vary with time. Let \, and A, be roots of the polynomial 


Lp? + Rp + a (20) 


Since the numbers L, FR, C are positive {see (A)], the real parts of the roots 
dz and Ag are negative, so that the electrical process in the network S* will 
be damped with time [see §9, (A)]. This damping can occur, however, in 
various ways: if \, and \» are complex, then every nonzero solution of (19) 
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has an oscillatory character (see Example 3, §7); however, if 4; and Az are 
real, then the damping occurs aperiodically, that is, any solution of (19), 
starting at a certain instant, becomes monotonic. The question of whether 
the roots 4; and Ag are complex or real is determined by the sign of the 


number 
R\? 1 
A= (#) ~ Ie’ 


if A < 0, then the solutions of (19) are oscillatory; if A > 0, they are 
aperiodic. 

Of particular interest is the oscillatory loop S* where the resistance F is 
completely absent. In this case our circuit consists only of two passive 
elements ab and cd, and b = c, a = d (Fig. 19). Under this assumption 
the equation of the electrical network has the form 


1 
G + is) I(t) = 0. 


The general solution of this equation may be written in the form 
I(t) = 8 cos (wit + By), 


where w,; = 1/\/LC. Thus, in the absence of resistance in a passive 
oscillatory circuit, there occur nondamping oscillations with the frequency 


VLC 
In the general case the value 1/./LC is called the natural frequency of the 
oscillatory loop S. 


We shall now return to the study of the oscillatory loop S and examine 
the case of a harmonic voltage source U(t). 
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Since the roots of polynomial (20) have negative real parts, it is possible 
to examine the steady-state process in the network 8. We shall seek a 
solution by the method of complex amplitudes (see §12). Let U(t) = rei! 
be the complex harmonic oscillation with real amplitude r > 0. Then the 
right-hand side of equation (18) has the form 


pU(t) = p(re’) = itrwe**. 
We have 
T steady state (t) = oe! 


where the complex amplitude o of the current I steady state (4) is defined by 
the formula 
rw r 


° = TRo + (—Lw? + 1/0)” R+i (Le — 1/Ca) 


[see §12, (A)]. Hence for the real amplitude we obtain 


r 


VR? + (ha — 10a)? 


s = lol = 


From this formula it is clear that for a given amplitude r of the voltage 
source, the current amplitude s is maximum at the natural frequency 
w = w, = 1/\/LC of the loop S. For this frequency the amplitudes s and 
r are connected by the relation s = r/R, i.e., at this frequency the loop 
behaves as though only resistance were present. For the remaining fre- 
quencies the current amplitude s has a value smaller than r/R. This 
phenomenon is called resonance (compare the example in §12). The oscil- 
latory loop L, R, C resonates at its own natural frequency 1/./LC. 

2. (Transformer.) A transformer consists of two windings, primary 
and secondary, placed on a single core. A source of variable voltage is 
connected to the primary winding, and to the secondary winding a load, 
for example, external resistance. Each winding has inductance and 
resistance (internal). Between the windings there exists a mutual induc- 
tion. Thus the transformer may be considered an electrical network 
consisting of two separate loops inductively connected. The first loop 
consists of three two-terminal elements: a,b; is the inductance Ly, b,c, is 
the internal resistance R,, and a,c; is the voltage source U,,,. = U(t). 
The second loop also consists of three two-terminal elements: agbz is the 
inductance Le, becs is the internal resistance Rz, and code is the load re- 
sistance &. In addition, there exists a mutual induction M,,s,,c., = M 
(Fig. 20). By Kirchhoff’s first law we have 


Ta,b, = Doxey = Tea, = th; Tand. = Tbgcg = Lega, = Io. 
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Thus we have two loop currents, I,, Iz. By applying Kirchhoff’s second 
law we obtain 


Iypl, + MpI, + Ril, — U(t) = 0, 
Lopl, + Mpl, + RoI, + RI, = 0, 


or 
(Inp + Ri)I, + Mpl, = Ute). 
Mpl, + (Lop + Ro + R)I2 = 0. 


The determinant D(p) of this system has the form 
D(p) = (InL2 — M?)p? + (1 Re + LiR + LeRi)p + Ril(Re + R). 


By virtue of proposition (B) of §9 this polynomial is stable since L,L_. — 
M? > 0. We shall examine the performance of a transformer in the case 
where the voltage U(é) varies harmonically, and we shall seek the steady- 
state solution by the method of §12, (B). Let us set 


U(t) = uses, 


where uw, is a positive real number (the amplitude of the voltage applied 
to the primary winding). We shall seek the unknown functions J, and I, 
in the form 


I, = aye, Io = ove 


it 

? 
where o, and a2 are the complex amplitudes of the currents. 

The so-called ideal transformer, i.e., a transformer in which the values 
of Ry, Ry and L,L, — M® are negligible, is of the greatest theoretical 
interest. By not using these values in the equations for the values 0, and 
@2, we obtain 

Ly - 10001 + M - iwo, = U1, 


M - too, + (Le: iw+ R)oe = 0. 


Since M ~ \/L,Io, by subtracting from the second equation the first 
equation multiplied by —/L2/L,, we obtain 


Thus the amplitude u. = Rio] of the voltage drop across the load re- 


sistance R is 
ta = 2 
2> 7 U1; 
Ly’ 
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Figure 21 


the value ~/L2/L, is called the transformer ratio. Thus if Lz > Ly, we 
have a transformer which increases the voltage, 


Us 


> 1; 
U1 


if Lz < Ly, we have a transformer which decreases the voltage, 
Bel, 
U1 


3. (Electrical filter.) We shall study an electrical network with four 
junctions a, b, c, d and five two-terminal elements (I*ig. 21), where 
ab is the inductance L, 
bc is the inductance of the same quantity L, 
bd is the capacitance C, 
ad is the voltage source Uag(t) = U(d), 
cd is the load resistance R. 


Let us set 
In = th, Tye = Ip. 


Then by Kirchhoft’s first law we have 
Tha = I, — To, Tig = Ip. 
By Kirchhoff’s second law we have 


Ua + Use + Uca + Uda = 0, 
Use + Uca + Uay = 90, 
or 
LpI, + LpIn + RIz — UW) = 9, 


1 
IpI,+ Rl2 + (2 — I;) = 0. 
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Multiplying the second equation by p, we obtain the following system: 
Lpl, + (Lp + R)I2 = U(t), — 41+ (w+ ep +Z)hn=0 
The determinant of this system has the form 

D(p) = L*p* + LRp? + 72P 4 2. 


By Theorem 6, the polynomial D(p) is stable. We shall now assume that 
U = be**', where b is the real amplitude of the voltage (see §12). 
We shall seek unknown functions J, and J, in the form 


—_— iwt —_ tut 
I, = aye, I, = age’, 


where a, and dg are the complex amplitudes of the currents, i.e., we re- 
strict ourselves to finding a steady-state process. 
Determining the voltage drop U.g = RI, across the load, we have 


_ b/C 
02 = (R/C — LRw®) + iw 2L/C — L2w2)’ 


from which we determine the amplitude a = |a,|R of the voltage Ua: 


a = |aQ|R = 


V(R/C — LRw?)? + w? 2L/C — Tiq2)3 


For small values of the frequency w we have a/b ~ 1; in other words, 
voltages of small frequency are easily transmitted through a filter with 
almost no change of amplitude. For large values of the frequency we have 


a/b = R/CL2u3, 


so that high-frequency voltages almost fail to pass through, i.e., they are 
“filtered.” 


14. The normal linear homogeneous system with constant coefficients. 
Here the system of equations 


nr 
# = DY aje’, a=1,...,n, (1) 
jal 


with constant coefficients is solved. This system can be solved by the 
method of elimination (see §11, particularly Example 2). Here it is solved 
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by reduction of the matrix A = (a) to the Jordan form. In the case 
when all eigenvalues of A are distinct, the task of reducing it to Jordan 
form, i.e., to diagonal form, is quite elementary. In the general case, 
however, the problem of reducing A to the Jordan form is one of the most 
complicated in linear algebra. In what follows we will use only those 
results of the present section which are based on the reduction of A to a 
diagonal form in the case of distinct eigenvalues. Results involving the 
reduction of A to the Jordan form in the general case will not be applied 
later. They are presented in propositions (C), (D), and (G) and in Theo- 
rem 28, but these propositions and Theorem 28 may be skipped without 
impairing our comprehension of subsequent material. 

Usually the reduction of A to the Jordan form for a solution of system 
(1) is effected by means of a linear transformation of the indeterminates 
x!,..., 2%. This method is presented at the end of the present section 
under the title “Transformation of variables.” A second method, also 
based on the reduction of A to the Jordan form, is presented in the first 
part of this section. 

In this section we shall make no distinction between a matrix A and the 


corresponding transformation A in the vector space x = (z!,..., 2”), 
because the basis will not change. The only exception is the proof of propo- 
sition (T°). 


The case of simple roots of the characteristic equation. (A) The system of 
differential equations (1) may be rewritten in vector form 


k= Ax. (2) 


Here A = (ai), and in place of the system of indeterminates x',..., 2”, 
we introduce the indeterminate vector 


x= (z},...,2%); 


by the derivative x of x we mean the vector (z!,..., 4"). If his an eigen- 
vector of A with eigenvalues 4, i.e., if 


Ah = db 
[see §32, (B)], then the vector function x defined by the relation 


x = he, 
is a solution of (2). 
This last proposition may be verified by substituting x = he™ into (2). 


THEOREM 10. Let 
x= Ax (3) 


be a system of differential equations [see (A)] such that the eigenvalues 
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Ai, ---, An Of the matrix A are mutually distinct, and let 
h,,...,h, 
be the corresponding eigenvectors of this matrix. If we set 
x; = he, a= 1,...,%, (4) 
then the vector function 
x = clxy +--+ + cE, (5) 


where c!, ..., c” are constants, is a solution of equation (3), and any 
solution of equation (3) is given by (5). 


Proof. By proposition (A) every function x; is a solution of (3), and there- 
fore, by proposition (A) of §6, formula (5) always gives a solution of 
equation (3). We shall now prove that every solution of equation (3) can 
be written in the form (5). Let g() be an arbitrary solution of equation (8). 
By Theorem 3 the solution g(t) can be assumed to be defined on the entire 
line —co <t < ow. Thus this solution is also defined at ¢ = 0. Let us 


set (0) = Xo. Let 
Xo = chy +----+ c*h, 


be an expansion of the vector xX, in terms of the basis vectors hy, ..., hy; 
the vectors h,,..., h, form a basis by virtue of proposition (C) of §32. 
Then the solution x, defined by formula (5), evidently satisfies the initial 


conditions 
x(0) = Xo. 


The same initial conditions ¢(0) = Xo are also satisfied by the solution 
y(t); thus, by the uniqueness theorem (see Theorem 2), x = g(t), so that 
Theorem 10 is proved. 

In the case that the matrix (a}) defining equations (3) is real, we have 
the problem of separating the real solutions from all the solutions (5). 

(B) We shall assume that the matrix (a}) defining equation (3) is real, 
and we shall select vectors h;, ..., h, in such a way that real vectors 
correspond to real eigenvalues and complex conjugate vectors to complex 
conjugate eigenvalues. Then in the system of solutions (4) to each pair of 
eigenvalues will correspond a real solution, and to every two complex 
conjugate eigenvalues will correspond complex conjugate solutions. There- 
fore, the solution (5) is real if and only if the constants attached to real 
solutions are real and the constants corresponding to complex conjugate 
solutions are conjugate. 

The validity of proposition (B) follows directly from proposition (D) 
of §7. 
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The general case. We turn now to the solution of the system (1) in the 
general case, where the matrix (a}) may have multiple eigenvalues. The 
analysis of this case rests upon an important and difficult-to-prove algebraic 
theorem concerning the reduction of a matrix to the Jordan form (see §34). 

(C) Let us write system (1) in vector form 


x= Ax (6) 
and let 
hy,..., be 


be a certain basis set with eigenvalue \ [see §34, (A)] with respect to the 
matrix A, so that the relations 


Ah, = Ah, Abhyg = Aho + h,, sey Ah, = dh; + h;_1 


are satisfied. Let us introduce a sequence of vector functions, assuming that 


r—1 2 


t 
of = Goat ea ayy bet +b, r=1,...,k& (7) 


We then find that the vector functions 
X, = a,(t)e™, r=1l,...,k, (8) 
are solutions of equation (6), and in addition 
x,(0) = hy. (9) 


Thus to every basis set of k vectors corresponds a system of k solutions. 
To prove that the vector functions (8) are solutions of (6), we shall state 
the following two identities concerning the vector functions (7): 


6-(t) = w,_1(8), r=1,...,k; 
Aw,(t) = dAw,-() + w,_1(d), r=1,...,k 
In these relations wo(t) is taken to equal 0. Both may be directly verified 


by elementary calculations. With the aid of these identities, it is imme- 
diately seen that the functions (8) are solutions of (6). Actually, we have 


&- (de + rw, (de™ = (w,_1(t) + dw (t))e™ 
= Aa,(t)e’ = Ax,(t). 


X,(2) 


We shall now turn to the formulation and proof of a theorem which 
gives the solution of (1) in the general case. 
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THEOREM 11. Let 
x= Ax (10) 


be the vector form of the system (1). By Theorem 28 (see §34) there 
exists a basis hy,..., h,, consisting of basis sets relative to the matrix 
A. Specifically, we shall assume that hy,..., hy,, isa basis set with the 
eigenvalue Aj, that hy,41,..., He,4x%, is a basis set with the eigenvalue 
de, etc. By proposition (C), to each basis set corresponds a system of 
solutions, so that we can write the following solutions of equation (10): 
fant 


x; = hye! 16. Xk = — h, +.--- + bs.) et 
det 


Xe,41 = hy, +1€ , (11) 


‘ pk2-1 
Xk, tke = — he,41 terre bn, +8) ent etc. 
Hence, the formula 

x= clx,;+---+ cK, (12) 


where c!,..., c” are constants, always gives a solution of equation (1), 
and every solution of (10) may be written in terms of (12). 


Thus the functions x,, ..., Xn, constitute the so-called fundamental 
system of solutions of equation (10). 

Proof. Since the functions x;, ..., X, are solutions of (10) [see (C)], 
then, by proposition (A) of §6, formula (12) always gives a solution of (10). 
We shall show that every solution of (10) may, by suitable choice of con- 
stants c!, ..., ¢”, be written in form (12). Let g(t) be an arbitrary solu- 
tion of (10). By Theorem 3, the solution g(t) can be assumed to be defined 
on the entire line —co < t < o, so that the vector o(0) = Xp is defined. 
Let us expand this vector in terms of the basis hy, ..., hy: 


Xo = eth, + se + c"h,. 


If the constants c!, ..., e” determined are now substituted into (12), we 
obtain a solution x(é), which satisfies the initial conditions 


x(0) = c!x,(0) + +++ + e%xn(0) = clhy +--+ + c%hy = Xo 


[see (9)]. Thus the solutions g(é) and x(é) have common initial values and 
therefore coincide. Theorem 11 is thus proved. 

It now remains to separate from the solutions given by (12) the real 
solutions in the case when the matrix (aj) is real. This is done in exactly 
the same manner as in the case of simple roots of the characteristic equa- 
tion. 
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(D) Let us assume that the matrix (a}) which defines equation (10) is 
real. In this case we choose the basis hy, ..., bh, by the method specified 
in Theorem 28 (see §34) for the case that the matrix (a;) is real. With 
this choice of basis, there will be on the one hand, among the solutions (11) 
constructed in Theorem 11, real solutions and, on the other hand, pairs 
of complex conjugate solutions. Thus the solution (12) is real if and only 
if the constants corresponding to real solutions are real and the constants 
corresponding to complex conjugate solutions are complex conjugates. 
Proposition (D) now follows directly from (D) of §7. 

In conclusion it should be noted that the above results of the present 
section are closely connected with the results of §7 and §8, where we 
studied one mth-order homogeneous equation with constant coefficients. 
By the method of §4 such an equation can be written in the form of a 
normal system of n equations. Thus the results of §7 and §8 can be derived 
from the results of the present section. In addition it is found that the 
characteristic polynomial of the normal system obtained coincides with 
the characteristic polynomial of the initial equation. 

Transformation of variables. (E) Let us replace the indeterminates 


zt,..., 2” (13) 


in the system (1), which is defined by the constant matrix A = (aj), by 
new indeterminates 

Yee y”, (14) 
by setting 


y = YS siz’, f=1,...,%, (15) 
i=l 


where s/ are constant coefficients with a nonsingular matrix S = (s’). 
In terms of the new indeterminates our system may be written in the form 


y= Dd diy’, t= 1,...,7, (16) 
where the matrix B = (b') is obtained from A by the formula 
B= SAS™}. (17) 


We shall now prove this. Differentiating relation (15) with respect to ?, 
we obtain 


y= dS sie’, j=l,...,n. (18) 
i=l 


Thus, the components of the vector = (z!,..., 4") are transformed in 
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the same way as the components of the vectorx = (z!,..., 2”). Formula 
(17) follows directly from this transformation [see §32, (A)]. We shall 
carry out anew, however, the proof given in §32, (A). The relations (15) 
and (18) may be rewritten in the matrix form 


y=Sx, y= Si, 


so that we have 
y = Sk = SAx = SA-S7y, 


which proves formula (17). 

By a proper choice of the matrix S we can obtain the simplest form of B. 
Since the transformation (17) of the matrix A into the matrix B is effected 
by means of the matrix S, we can attain the Jordan form of the matrix B 
[see §34, (B)]. 

(I) If each eigenvalue 

ee 


of the matrix A of the system (1) is distinct, then the linear transforma- 
tion (15) can be chosen in such a way that (16) takes the form 


y* = ry", k=1,...,n. (19) 


If, in addition, the matrix A is real so that together with every complex 
eigenvalue A; in the sequence Aj, ..., A» there appears the complex conju- 
gate A; = Ax, and if the variables (13) are real, then the transformation (15) 
can be chosen so that to every real eigenvalue \; corresponds a real variable 
y’, and to the pair of complex conjugate eigenvalues A; and \; correspond 
the complex conjugate variables y" and y' = 7". Thus to the conjugate 
eigenvalues correspond the conjugate equations 


yr = ny”, Thi = xy. (20) 


Let 
Ne = Mk + tx, yk = B+ an, (21) 


where px, Vz, £*, n* are real numbers. Then the pair of conjugate equa- 
tions (20) can be replaced by a pair of real equations 


EF = ppt! — ven*, te = Vet” + pen’. (22) 


Carrying out a similar substitution for every pair of complex conjugate 
eigenvalues, we shall be able to substitute for the system of real variables 
(18) a new system of real variables, whose equation has partly the form 
(19) (for real \;) and partly the form (22) (for pairs of complex conjugate 
eigenvalues). 
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To prove proposition (I*) in the space R of vectors x = (z!,..., 2”), 
we make the matrix A correspond with the linear transformation A (see 
§32), and we denote by hy, the eigenvector of the transformation A with 
eigenvalue \,. Asa basis in & we shall now take the vectors 


hi,..., br, (23) 


and we denote the coordinates of the vector x corresponding to this basis 
by y},..., y”. Thus we shall obtain the linear transformation (15). In 
the new system of coordinates the matrix B corresponds to the transforma- 
tion A. Finally, since the matrix B has obviously a diagonal form with 
the numbers )j,..., A, in the diagonal, the system (16) takes the form (19). 
Now if the matrix A is real, then to each real eigenvalue dA, will corre- 
spond the real vector h,, and to a pair of complex conjugate eigenvalues 
dz and A; = Ag, will correspond a pair of complex conjugate eigenvectors 
bh, and h; = by. An arbitrary vector x in the new coordinate system 
may be written in the form 
nr 
x= >> y’hj. (24) 


j=1 


If it is real then the coefficients of real vectors must be real and the 
coefficients of complex conjugate vectors must be complex conjugate 
[see §7, (D)]. Thus a real variable y’ corresponds to every real eigenvalue 
\;, and the complex conjugate values y* and y’ = y* correspond to the 
pair of complex conjugate eigenvalues A; and A, = Xx. 

We shall write equation (20), after substituting in it the values A, and 
y* from (21), for the transition from a pair of complex conjugate equa- 
tions (20) to a pair of real equations (22). We obtain 


+ in® = (ux + ive) (E® + in*) = wnt ® — van® + iet* + wen"). 


By equating separately the real and imaginary parts of this relation, we 
obtain the system of equations (22), and proposition (I°) is proved. 
The system of equations (19) has the obvious solution 


y* = ce, k= 1,...,%, 
but in order to obtain a solution of the original system (3) it is necessary 
to go from the indeterminates (14) to the indeterminates (13), and for this 
transition we must know the eigenvectors (23) of the matrix A [see (24)]. 
Thus proposition (I°) is equivalent to Theorem 10. 

For the solution of (1) in the general case, the reduction of the matrix A 
to the Jordan form can be used. The proposition (G) for this case is 
equivalent to Theorem 11. 
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(G) Let the matrix A of the system (1) be arbitrary. We shall select 
the transformation (15) in such a way that the matrix B has the Jordan 
form (see §34). Let \ be one of the eigenvalues of matrix A and let k be 
the dimension of one of the Jordan cells of B with eigenvalue X. We shall 
assume that this cell occupies the first & rows. The system of equations 
corresponding to this cell has the form 


To every other Jordan cell of B corresponds a similar system of equations 
which is easy to solve. 


EXAMPLES 


1. The application of the method presented in this section to the solu- 
tion of system (1) requires the determination of the basis hy,..., bh, of 
the vector space which consists of basis sets (see Theorem 11). This 
determination itself presents a certain algebraic problem. By using the 
results of this section, we shall show how the system (1) can be solved by 
the method of undetermined coefficients without finding the basis which is 
composed of basis sets. Let \ be a certain eigenvalue of the matrix (aj). 
To this eigenvalue, generally speaking, correspond several basis sets con- 
tained in the basis hy,...,h,; let k be the longest of the basis sets cor- 
responding to the eigenvalue 4. By Theorem 11, each of the solutions 
corresponding to the eigenvalue \ can be written in the form 


v=fide’, t=1,...,n, (25) 


where f‘(t) is a polynomial of degree < k — 1. Thus, substituting into 
system (1) a solution in the form (25) and assuming that the coefficients 
of the polynomials f‘(£), i = 1, ..., n, are unknown constants, we can, 
by using the method of undetermined coefficients, find all solutions of (1) 
corresponding to the eigenvalue \. In order to solve the system (1) by this 
method it is not necessary to know the basis sets corresponding to the 
eigenvalue A; it is necessary to know only the lengths of these basis sets. 
Determination of the lengths is a simpler algebraic problem than reduction 
to the Jordan form; it is solved by the theory of elementary divisors of 
matrices, which pertains to linear algebra. The theory of elementary 


divisors is not used in this book. 
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2. We shall now show how to solve system (1) by the method of elimina- 
tion presented in §11. To apply the method of elimination, we shall write 
(1) in the form 


», Li(p)x’ = 0, 


j=l 
where 
Li(p) = aj — p 8}. 


The determinant D(p) of the matrix (Li(p)) in this case is a characteristic 
polynomial of the matrix (a}). Let \ be a certain root of D(p) or, what is 
the same thing, an eigenvalue of the matrix (a'). The multiplicity of \ 
will be denoted here by 2. By proposition (C) of §11 every solution of (1) 
corresponding to the root d is to be found in the form 


x’ = gi(te, i= 1,...,n, 


where the degree of the polynomial g’(t) does not exceed 1 — 1. If we 
compare the method presented in this example with that in Example 1, 
we see that the whole difference lies in the determination of the maximal 
degree of the polynomials. The method of Example 1 gives a more accurate 
determination of the degree of the polynomials, since the number k, 
generally speaking, is smaller than the number J. Indeed, the sum of all 
lengths of basis sets corresponding to d is equal to 1. Thus the equality 
k = I can hold only when there is only one basis set corresponding to the 
eigenvalue A. 


15. Autonomous systems of differential equations and their phase 
spaces. We shall give here a geometrical interpretation of an autonomous 
system of equations in the form of the phase space of this system. This 
interpretation differs essentially from the geometrical interpretation of 
the system of equations in §] and should, therefore, more correctly be called 
a kinematic interpretation, since to every solution of its system of equations 
corresponds not a curve in a space, but the motion of a point along the 
curve. The kinematic interpretation (the phase space) is in certain respects 
more expressive than the geometrical (the system of integral curves). 

Autonomous systems. A system of ordinary differential equations is 
called autonomous if it does not explicitly contain the independent variable 
é (or, as we shall call it, the tme). This means that the law of variation 
of the unknown functions which are described by the system of equations 
does not change with time, as is usually the case with physical laws. It is 
very easy to prove that if 


x’ = gi(t), @#=1,...,%2, 
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is a solution of a certain autonomous system of equations, then 


v=o) =e(tteo, it=1,...,n, 


where c is a constant, is also a solution of the same autonomous system of 
equations. We shall carry out the proof of this fact by an example of a 
normal autonomous system of equations. 
(A) Let ; ; 
t= fi(xt,..., 2”), t=1,...,%, (1) 


be an autonomous normal system of nth-order equations and 
x = f(x) 


its vector notation. The autonomy of system (1) consists of the fact that 
the functions f‘(z!,...,2"),i = 1, ..., ”, are functions of the variables 
z!,...,a" and do not depend on the time t. We shall assume that the 
functions f*(x1, ..., 2”) are defined in a certain domain A of the n-dimen- 
sional space where x',..., x” are the coordinates of a point. We shall 
assume further that f*(x!,..., x") and their first-order partial derivatives 


are continuous in the domain A. Thus, if 
v=), “i=1,...,n, (2) 
is a solution of (1), then 


x = ¢x(E) = v(t + ¢), t= 1,...,%, (3) 


is also a solution of (1). It is evident that if the solution (2) has as a 
maximal interval of existence the interval m,; < ¢ < mg, then the solu- 
tion (3) has the maximal interval 


m,—e<t<m2—e. 


T'rom the differentiation formula for a composite function, we have the 


relation ; 
ex(t) = g(t +0), a= 1,...,n. (4) 
Indeed, 


ei d ; d ; d é 
eH) = Ger) = Geto = Gap eb tO ay 


= ¢@'(t+e-l1=¢(t+o). 


We shall now prove that (3) is a solution of the system (1). Since (2) is 
a solution, we have the identities 


HH) =f(el'O,...,e°O), t= 1,...,0. 
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Replacing ¢ in these identities by t + ¢ we obtain 
gt+to=fi(eltt+o),...,e%t+0), a= 1,...,”. 


Combining this with (4) and (3), we have 


ee) =e t+oO=f(e't+o,..., ett) = sek, -.-, ck(O). 


We shall now turn to the kinematic interpretation of the solutions of 
system (1). Formally we shall speak about an interpretation in n-dimen- 
sional space, but for the sake of clarity it is reasonable to imagine the case 
of a plane (n = 2). 

(B) To every solution 


viz gif), i= 1,...,n, (5) 


of the autonomous system (1) we make correspond the motion of a point 
in n-dimensional space defined by equations (5), where x1, ..., x” are the 
coordinates of the point in space and ¢ is the time. In the course of its 
motion the point describes a curve known as the trajectory of the motion. 
If we associate with the solution (5) not the process of motion, but the 
trajectory of the motion of the point, then we shall obtain a less complete 
picture of the solution, since it is also desirable to indicate the direction 
of the motion on the trajectory. Thus, if there is another solution 


= v(t), t= 1, reg Ny (6) 


in addition to (5), then the trajectories corresponding to these solutions 
either do not intersect in the space or else they coincide. That is, if the 
trajectories have even one common point, i.e., 


g(t) = ¥'(te), t= 1, rey K, (7) 
then 
V(t) = ¢'(t+ec), where c= t, — fo. (8) 


These last equalities show that the trajectories described by the first and 
second solutions coincide, but the first solution describes the same trajec- 
tory as the second with the time “delay” c. If the point corresponding to 
the first solution has reached a certain position on the trajectory at instant 
t + ¢, then the point corresponding to the second solution has already been 
in this position at the instant ¢. . 

Tn order to derive (8) from (7), we shall examine the solution 


ex(t) = g(t +e) (9) 
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along with (5) [see (A)]. Equations (7), force = t; — te, yields the equality 


gk(t) = e'(te +0) = e'(ti) = V'(te), t= 1,...,7” 


Thus, the solutions (6) and (9) of the system (1) have common initial 
conditions (namely, their values at the instant t,), and therefore by the 
uniqueness theorem they must coincide, so that we have 


VOQ=e80=¢'(t(+o, t=1,...,n 


States of equilibrium and closed trajectories. We pose the question of 
whether a trajectory representing a solution of the system can intersect 
itself. 

(C) Let 

v=y(t), “i=l,...,n, (10) 


be a certain solution of the system (1) defined on a maximal interval 
m, < t < ma. We shall assume that the equalities 


g'(ti) = ote), t=1,...,n, th # te, (11) 


are valid, where t; and ¢e, of course, belong to the interval my < t < mp. 
It then turns out that my = — 0, m2 = +. [i.e., the maximum interval 
of existence for the solution (10) is the entire line] and that the following 
two mutually exclusive cases are possible. 

1. Yor all values of ¢ the equality 


g(t) = a’, a= 1,...,%7, 


is valid, where (a!, a?,..., a”) is a point of the domain A which does not 
depend on ¢. Thus in this case the point (¢'(t),..., ¢"(£)) actually does 
not move as ¢ varies but remains fixed. In this case the solution (10) 
itself with the point (a!,..., a”) is called a state of equilibrium of the 
system (1). 

2. There exists a positive number 7’ such that for arbitrary ¢, the 
equalities 

e(i+ T) = ¢'(d), a= 1,...,%, 


are valid, but for |r; — 7.| < T and for at least one i = 1,..., 7, the 
inequality 
e'(71) # ¢'(72) 


is valid. In this case the solution (10) is called periodic with period 7, and 
the trajectory described by (10) is called a closed trajectory or a cycle. 
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Tirst of all, we shall show that the maximal interval of existence of 
solution (10) is the entire straight line. As was noted in proposition (B), 
the identities 


g(t +c) = ¢(0), az=1,...,%”, c= ti — fe, (12) 


follow from equality (11). Since by this equality the interval my — c < 
t < mz — ¢ coincides with the interval m, < t < me, we have m, = 
—Bf,mMo = +a . 

Every number c for which (12) is satisfied will be called a period of the 
solution (10); the set of all periods of the solution (1) is designated by F, 
which is a certain set of numbers. We shall establish some of its properties. 
Substituting 4 — c for ¢ in (12), we obtain ¢'(t) = g(t — c). Thus, if c 
is a period, then —c is also a period. Let us assume that c, and ce are 
periods, i.e., that 


v(t +e) = ¢'(t), v(t +¢2) = ¢'(t); t= 1,...,7. 
Then 
e((ttte+ta)=ett+e) =e), i=1,...,2 


Thus if c; and cz are periods, then c, + cz is also a period. Let us assume 
that ¢, Co,..., €m,...is a sequence of periods which converges to a 
certain number cq; then we have 


e(ttemn) = ¢(t); t=1,...,n, m=1,2,.... 
Since the functions ¢'(£) are continuous, then for m — o we have 
g(t + co) = ¢'(b), 


i.e., we see that cg is also a period, because the set F is closed. 

Since the number c in (12) is distinct from zero (t; ~ ty), the set F con- 
tains numbers distinct from zero. From the properties of the set F which 
have been established it is easily seen that there exist only two possibilities: 
(1) the set F coincides with the set of all real numbers; (2) in the set F 
there is a minimal positive number 7, such that F consists of all integer 
multiples of the number 7. Let us prove that there are actually only these 
two possibilities. Since the set F contains the number —e whenever it 
contains the number c, and since in F there are numbers different from 
zero, there are positive numbers in F’. 

Let us assume that there is no least positive number in F, i.e., that for 
every positive number e there is a positive period c < ¢. From the above 
properties of the set F it follows, since ¢ is a period, that all numbers mc, 
where m is an integer, are also periods. Since c < e, then for an arbitrary 
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real number €p it is possible to find an integer m such that |co — me| < e. 
Thus an arbitrary number co is a limit point of the set F, and therefore, 
in view of the fact that set F is closed, this set coincides with the set of all 
real numbers. 

Let us now assume that F is not the set of all real numbers. By what 
has been proved, there then exists in F a least positive number 7’. Let c be 
an arbitrary period. Then it is possible to select an integer m such that 
lc — mT| < T. Let us assume that ec ~ mT; then |c — mT| is a period 
distinct from zero; but this is impossible since |c — mT| < T, which 
contradicts the minimal character of the number 7’. It is thus proved that 
every number c from F can be written in the form c = mT, where m is 
an integer. 

Now it is easy to verify that, if F is the set of all real numbers, then 
case (1) occurs, and if F is not the set of all real numbers, then case (2) 
occurs. Thus proposition (C) is proved. 

Proposition (C) can be formulated briefly by saying that there exist 
three kinds of trajectories: (1) those of the state of equilibrium; (2) periodic 
trajectories (cycles); and (3) nonintersecting trajectories. It is natural to 
take the last case as the “most general.” 

Irom Theorem 2 it follows that a trajectory representing a solution of 
the system passes through every point of the domain of definition of the sys- 
tem (1). Thus, the entire domain A is filled with trajectories, and in accord- 
ance with (B) these trajectories do not intersect each other in pairs. Those 
trajectories which do not intersect are of particular interest; they represent 
either states of equilibrium or cycles, and are quite important. 

This is the kinematic interpretation of solutions of an autonomous 
system of equations. The system of equations itself also admits a geo- 
metric interpretation. 

Phase spaces. (D) Since the autonomous system of equations (1) is 
defined in the domain A, each point (x9, ..., 2%) of the domain A corre- 
sponds to a sequence of m numbers, namely the sequence 


f'(eo, «++, 20), ++ + F*(@Oy «+ - » #0). 


These numbers can be thought of as components of a vector f(x9, .. . , 76) 
in an n-dimensional space emanating from the point (2g, ... , #2). Thus the 
autonomous system gives rise to a geometric picture, a vector field defined 
in the domain A. The vector f(xj,..., 2%) is defined at every point 
(xj, ..., #8) of A, starting from this point. The connection between the 
geometrical interpretation of the solutions and the geometrical interpreta- 
tion of the system of equations itself is given by the following. Let 
(2g, ..., £8) be an arbitrary point of A. In the geometrical interpretation 
of the system of equations the vector f(x, ..., 2%) corresponds to the 
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point from which it starts. Further, by Theorem 2 there exists a solution 
x’ = ¢'(t) of (1) which satisfies the initial conditions 


¢' (to) => XO, t= 1, wee Kt 


According to the kinematic interpretation, the solution z* = ¢'(é) cor- 
responds in the space to the motion of a point which describes a trajectory 
which, at the instant ¢ = fo, passes through the point (x4, ..., 2%) in the 
space. Thus the vector velocity of the point which describes the solution 
x’ = g(t) at the instant of its passage through the point (x, ..., 2”) 
coincides with the vector f(x3,..., 27). It is just this coincidence which 
is expressed by the system of equations (1) for 


x= Xo, t= 1,...,n; t= lo. 


The n-dimensional space, in which solutions of autonomous system (1) 
are interpreted in the form of trajectories and the autonomous system 
(1) itself in the form of a vector field, is called the phase space of the system 
(1). The trajectories in this space are called the phase trajectories, and the 
vectors f(xp, ... , 2%) are called the phase velocities. The connection be- 
tween the two interpretations consists in the fact that the velocity of the 
motion of a point along a trajectory at each instant coincides with the 
phase velocity given at that point of the space where the moving point is 
located at that instant. 

Let us now examine states of equilibrium from the point of view of phase 
velocities. 

(E) In order that the point (a1, ..., a”) of the domain A be a state of 
equilibrium of the system (1), i.e., that xt = y(t) be a solution of the 
system for which 


g@=a, i= 1,...,n, (13) 
it is necessary and sufficient that the phase velocity f(a',...,a”) at 
(a!,..., a”) be equal to zero. Thus to find all states of equilibrium of (1) 


it is necessary to solve the system of equations 
fial,...,a =0, t=1,...,n. 


This system is not a system of differential equations, but rather a system 
of finite (or algebraic) equations since it does not include derivatives. 

To prove proposition (FE), we shall assume that (a',...,@”) is a state 
of equilibrium, i.e., that there exists a solution x’ = ¢'(t) for which the 
relations (13) are satisfied, and we shall substitute this solution into (1). 
Since the derivative of a constant is zero, the substitution yields 


ig 1 mn @ i _@ to __ 
£@,.-,e) = Ged = Fe = 0. 
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Thus the phase velocity vector f(a!,...,a”) actually vanishes at the 

point (a',...,a”). Let us assume that, conversely, the phase velocity 

vector f(a}, ..., a”) vanishes at the point (a!,..., a”), i.e., that 
fia),...,a) =0, t=1,...,n, 


and we shall show that in this case the equalities (13) determine a solution 
of (1). Substitution gives 


e(t) = fi(a',..., a”), t= 1,...,7; 


these equalities are satisfied, since on the left we have the derivative of a 
constant and on the right, zero. 


EXAMPLES 


1. (Compare Example 3 in §2). We shall study the autonomous first- 
order differential equation 


é = f(z), (14) 


where f(x) and its derivative are continuous over the entire line P where x 
is allowed to vary. We shall assume in addition that the zeros of f(x) or, 
what is the same thing, the states of equilibrium of equation (14) do not 
have limit points. Under this hypothesis the states of equilibrium divide 
the straight line P into a system = of intervals. Each interval (a, b) of = 
has the property that the function f(z) does not vanish on it and that 
each of its endpoints, a or B, is either a zero of the function f(x), or is equal 
to toc. Thus > consists of a finite or countable number of finite intervals 
and not more than two semi-infinite intervals, or consists only of the infinite 
interval (— 00, 00). Let (a, b) be a certain interval of 2, xo a point of this 
interval, z = ¢(t) a solution of equation (14) with initial values 0, x9, and 
my, < t < ms, the maximal interval of existence of the solution g(t). To 
be definite we shall assume that f(z 9) > 0, so that 


a<¢(t) <b for m, <t< ma, (15) 
lim g(t) = a, lim g(f) = Bb. (16) 
tom, trom, 


Further, if either number a or 6 is finite, then the number m, or mz, 
respectively, is infinite. Thus (lig. 22) every interval (a, b) represents one 
unique phase trajectory of equation (14). 

We shall prove relations (15), (16). From the hypothesis that f(z9) > 0 
it follows that the function f(z) is positive on the interval (a, b), and there- 
fore every point of this interval describing a phase trajectory moves from 
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left to right. Thus with increasing ¢ the point ¢(é) can leave the interval 
(a, b) only by passing over the right-hand endpoint b. Let us assume that 
this takes place for some ¢ = ¢,; then at ¢ = t,, we have o(t,) = 5, 
which creates the impossible situation that the two different trajectories 
x = ¢(t) and x = bintersect. In exactly the same way we can prove that 
the point y(é) cannot leave interval (a, b) for decreasing ¢. Thus (15) is 
proved. 

Let us now assume that lim;m, o(é) = ¢ < 5, and let y(t) be a solution 
of (14) with the initial values 0, c. Since f(c) > 0, then for some negative 
value of tg we have (tz) < c, but this means that two different tra- 
jectories y(t) and y¥(é) intersect, which is impossible. Thus it is proved 
that lim,—m, g(t) = b. The relation 


lm = ¢(f) =a 
tm, 
is proved in exactly the same way. 

Let us assume, finally, that b < oo, and show that under this assump- 
tion mz = oo. Let us assume the contrary, that is, mz < oo. Let us then 
define a function x(t) by setting X(t) = g(t) for my < t < me, and 
x(t) = bfort > mg. It is evident that the function X(t) is continuous and 
satisfies equation (14), but this is impossible since two different trajectories 
x(é) and x = b would then intersect. This contradiction shows that 
Mz = +00. In exactly the same way it can be proved that fora > —o 
we have m; = —o. 

Let 6 be an arbitrary state of equilibrium of equation (14) and let 
(a, b) and (8, c) be the two intervals of Z adjoining it (on the left and right, 
respectively). Each of the intervals (a, b) and (6, c) represents one tra- 
jectory. If both points describing the trajectories (a,b) and (b,c) ap- 
proach (with increasing t) the state of equilibrium }b, then the state of 
equilibrium b is called stable (I'ig. 23(a)]. If both points describing the 
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trajectories (a,b) and (b,c) recede from the point b, then the state of 
equilibrium 6 is called unstable [T'ig. 23(b)]. If along one of the trajectories 
the point approaches and along the other it recedes, then the state of 
equilibrium 6 is called semistable [IFig. 23(c)|. In order that a state of 
equilibrium 6 be stable, it is necessary and sufficient that the function f(z) 
be positive on the interval (a, b) and negative on the interval (b,c). For 
state of equilibrium 6 to be unstable, it is necessary and sufficient that the 
function f(z) be negative on the interval (a, b) and positive on the interval 
(b,c). For a state of equilibrium b to be semistable, it is necessary and 
sufficient that the function f(z) have the same sign on both of the intervals 
(a, b) and (b, ¢). 

Let us assume that f(b) ~ 0; then the sign of function f(x) in the neigh- 
borhood of the point 6 is the same as the sign of the quantity f(b)(x — b). 
Hence it follows that for f(b) < 0 the state of equilibrium b of equation 
(14) is stable and for f(b) > 0 it is unstable. 

2. We shall study the equation 


z = f(z), (17) 


where f(x) is a periodic function with a continuous first derivative. To be 
definite we shall assume that its period is equal to 27. Everything said in 
Example I concerning equation (14) remains valid for equation (17) as 
well, since equation (17) is a particular case of equation (14). However, 
in order to take into account the specific character of equation (17) (the 
periodicity of function f(x)], we shall assume that the phase space of 
equation (17) is not a straight line but a circle K of radius one on which we 
choose a reference point 0 and a direction of motion (for example, counter- 
clockwise). To every number x we make correspond the point & of the 
circle K by marking counterclockwise from the reference point an arc of 
length x (Fig. 24). Then to all numbers x + 2ka, where & is an integer, 
there corresponds a unique point £ on the circumference. Since 


f(x + 2k) = f(x), 


it is possible to set f(t) = f(x), and the function f is then defined on the 
circumference K. Equation (17) now defines the motion of point & along 
the circumference K. If x(é) is a certain solution of equation (17), then 
the point &(t) corresponding to the number x(t) moves along circumference 
K. If ais a point on K such that f(a) = 0, then there exists a solution 
a(t) of (17) such that &() = a and a isa state of equilibrium of (17). Let 
us assume for simplicity that the state of equilibrium of (17) on AK has no 
limit points; then there is only a finite number of points or none at all 
(lig. 25). States of equilibrium divide the circumference into a finite 
system 2 of intervals. If there are no states of equilibrium at all, then the 
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system 2 contains only one “interval” (the circumference). If there is only 
one state of equilibrium a, then the system = also contains only one 
interval, which consists of all points of K with the exception of the point a. 
In the first case the interval has no endpoints at all; in the second case 
both its endpoints coincide. Let J be a certain interval of 2 and z(t) a 
certain solution of (17) with initial values 0, x9, where & is a point of J. 
The solution x(é) is always defined for all values of f, and the point &(£) 
belongs to the interval J. If the interval J has endpoints (one or two), 
then the point traverses J in a fixed direction, the solution &(t) passing 
once through cach point of J. If the interval J coincides with the entire 
circumference, then after leaving the position &, the point will return to 
it after a certain time 7, so that &(0) = &(7). In this case the motion 
&(T) depends periodically with period T on the number ¢. The numerical 
solution x(t) of equation (17) corresponding to the motion £(t) satisfies 


the condition 
a(it+ T) = x(t) + 27. 


From this example it is apparent that it is not always appropriate to 
consider a euclidean coordinate space as the phase space of a system of 
equations; sometimes it is necessary to consider a more complex geo- 
metrical configuration. In Example 3 below we shall encounter this cir- 


cumstance in a more complex situation than in this example. 
3. We shall investigate the system of equations 


t= fi(z',2’), i= 1,2, (18) 


where the function f‘(z!, 2”) is periodic of period 2m in each of its 
arguments: 


fie! + Qka, x? 4+ 2lr) = fi(z',2?), i = 1,2. 


As always, we shall assume that the functions f*(x!, x”) are continuous and 
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have continuous first-order partial derivatives. In view of the periodicity 
of the functions f*(z!, x”) it is reasonable to assume that the phase space 
of (18) is not a plane but a more complex geometrical configuration, 
namely, the surface of a torus or, as it is called, a torus (ig. 26). We shall 
describe this surface. 

In a three-dimensional euclidean space with cartesian coordinates 
Z, Y, 2, we take in the xz-plane the circle K with a radius of one and with 
its center at the point (2, 0,0). We take as origin on the circumference the 
point with coordinates (3,0,0). Then to every number x! we make 
correspond the point £! of K (see Example 2). We now rotate the rz-plane 
about the z-axis in the (2, y, z) space. The surface P described by the cir- 
cumference K in this rotation is a torus. Let £! be some point of K. Asa 
result of rotating the zz-plane through the angle x” measured in radians, the 
point £! goes over into a certain point p of the torus P (Fig. 26). If the 
rotation is made not through the angle x? but through the angle x? + 2kz, 
then we arrive at the same point p on the torus P. Thus the point p on 
the torus P is defined uniquely by two cyclic coordinates t!, &®, and to each 
pair of cyclic coordinates ¢!, £? corresponds one well-defined point on the 
torus. Thus we see that the functions f‘(z!, x”) can be considered as defined 
not on a plane, but on the surface of the torus P: 


fel, P) = fa", 2”). 


Now let x'(é), 27(¢) be a certain solution of system (18). If we make the 
correspondence between the numbers z!(t) and z?(é) and the cyclic co- 
ordinates £1(t) and £7(t), we obtain the point é1(t), £7(t) on the torus P. 
Thus, every solution z!(é), 2?(¢) of (18) can be represented by the motion 
of a point on the torus, the law of motion at each instant being defined by 
that point £1(¢), £7() of the torus through which the trajectory passes at 
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that instant. This is explained by the fact that functions f‘(é!, £?) are 
defined on the torus. Thus the entire torus P is found to be covered by 
trajectories, each two of which either do not intersect, or else coincide. 
In particular, if a trajectory intersects itself, it is then either closed or it is 
a state of equilibrium. 

The representation of phase trajectories of (18) not on a plane but on 
the surface of a torus reflects the specific property of the system (18) 
(periodicity of the functions f') and is a convenient means of studying it. 


16. The phase plane of a linear homogeneous system with constant 
coefficients. In this section we shall construct phase trajectories on the 
phase plane of the system 


11 1,2 
ayx + aox", 


&- 
I 


(1) 
“= arr} + age’, 
or, in vector form, 

k = A(x), (2) 


with constant real coefficients a}. Here we shall have to investigate several 
different cases, since the phase picture of the trajectories of a system 
depends essentially on the values of the coefficients. 

It should be noted that the origin (0, 0) is always a state of equilibrium 
of the system (1). This state of equilibrium is unique if and only if the 
determinant of the matrix (a) is different from zero, or, what is the same 
thing, if both eigenvalues of this matrix are different from zero. 

Let us assume that the eigenvalues of matrix A are real, distinct, and 
nonzero. Then, according to the results of §14 (Theorem 10), any real 
solution of (2) can be written in the form 


x = clhye™! + c7hye!, (3) 


where h, and hg are real, linearly independent eigenvectors of the matrix 
A, \; and dg are its real eigenvalues, and c! and c? are real constants. We 
shall expand (3) in terms of the basis vectors (h;, hg) by setting 


x = §'h, + éhp, (4) 


whence we have 
gl = led! £2 — o2prot, (5) 


Generally speaking, the coordinates £!, £? on a phase plane P of the sys- 
tem (1) are not rectangular; therefore, we shall make an affine mapping 
of the phase plane P onto an auxiliary plane P* in such a way that the 
vectors hy, hy are transformed into mutually orthogonal unit vectors of 
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the plane directed along the axis of abscissas and the axis of ordinates, 
respectively (Irig. 27). The pointx = &'h, + £*h» of the plane P is trans- 
formed by this mapping into a point with rectangular cartesian coordinates 
t', &? in the plane P*. Thus the trajectory defined by the parametric 
equations (5) in the plane P will be mapped into a trajectory (which we 
shall also call phase trajectory) defined by the same equations in the rec- 
tangular coordinates of the plane P*. We shall first plot the trajectories 
defined by the equations (5) in P*, and then we shall map them back 
into P. 

Together with phase trajectory (5) in P* there is a trajectory defined by 


the equations 
q gl _— cle! £2 —_— crt (6) 


and also a trajectory defined by the equations 
t! — cle! ‘a — c7e2!. (7) 


The trajectory (6) is obtained from trajectory (5) by a mirror reflection in 
the axis of abscissas, and trajectory (7) is obtained by reflexion in the axis 
of ordinates. Thus the two mirror images shown leave invariant the picture 
of the trajectories in P*. From this it is evident that, if trajectories are 
drawn in the first quadrant, then it is easy to imagine the entire phase 
picture in the plane P*. 

We shall note that for c! = c? = 0 we obtain the motion of a point 
which describes the state of equilibrium (0,0). For c? = 0, c! > 0 we 
obtain a motion which describes the positive semiaxis of abscissas; for 
c! = 0, c” > 0 we obtain a motion which describes the positive semiaxis 
of ordinates. If A, < 0, then the motion which describes the positive 
semiaxis of abscissas proceeds toward the origin; if \; > 0, then this mo- 
tion is directed away from the origin. In the first case the point approaches 
the origin as closely as we please; in the second, it recedes without bound 
toward infinity. The same is true also of the motion which describes the 
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positive semiaxis of ordinates. If c; and ce are positive, then the motion 
of the point occurs within the first quadrant, without approaching the 
boundary. 

We shall carry out separately a further, more detailed description of the 
phase plane for several cases, depending on the signs of the numbers 
Az and dg. 

(A) Node. Let us assume that both numbers, \; and Ag, are different 
from zero and have the same sign, and that 


Ida] < |Agl. (8) 
We shall investigate first the case when 
A, < 0, Ao < 0. 


Under these hypotheses the motion along the positive semiaxis of abscissas 
is directed toward the origin, just as is the motion along the positive semi- 
axis of ordinates. Further, the motion along an arbitrary trajectory inside 
the first quadrant consists of an asymptotic approach of the point toward 
the origin, the trajectory in this case being tangent to the axis of abscissas 
at the origin. For t - —o, the point moves so that its abscissa and 
ordinate increase without bound, but the increase of the ordinate is more 
rapid than that of the abscissa, i.e., the motion goes in the direction of the 
axis of ordinates. This phase picture is called a stable node [I°ig. 28, (a)}. 
If the inequalities 
1 > 0, A2 > O 


together with inequality (8) are fulfilled, then the trajectories remain as 


118 LINEAR EQUATIONS WITH CONSTANT COEFFICIENTS [cHap. 2 


2 


|N\\\ 


v oe 


\W/ 


Ficure 29 


before, but the motion along them is in the opposite direction, and we have 
an unstable node (Fig. 28, (b)]. 

(B) Saddle point. Let us assume that the numbers A, and \2 have oppo- 
site signs. To be definite we assume that 


1 <0 < do. 


In this case the motion along the positive semiaxis of abscissas is directed 
toward the origin, and the motion along the positive semiaxis of ordinates 
is directed away from the origin. The forms of the trajectories lying inside 
the first quadrant resemble hyperbolas and motions along these trajectories 
proceed toward the origin along the axis of abscissas and away from the 
origin along the axis of ordinates. This phase picture is called a saddle 
point, or a saddle. 

Figures 28(a), 28(b), and 29 give a picture of trajectories on an auxiliary 
phase plane P*. The distribution of trajectories on the phase plane P is 
obtained by means of an affine transformation and depends on the position 
of the eigenvectors (see, for example, Figs. 30 and 31). 

We shall now investigate the case when the eigenvalues of a linear 
matrix A are complex. In this case they are complex conjugates and can 
be denoted by } = w+ iv and X = uw — iv, where vy ~ 0. The eigen- 
vectors of A can be chosen to be conjugate, so that they can be denoted 
by hand h. Let us set 

h = 3(b, — thy), 


where h, and hy are real vectors. The vectors h,; and hy are linearly 
independent, since if they were linearly dependent, then the vectors 
h and h would also be linearly dependent. Thus h, and hy may be assumed 
to form a basis for the phase plane P of equation (2). 
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An arbitrary real solution of (2) may be written in the form 
x = che + che™, (9) 
where ¢ is a complex constant. Let 
b= e+ it? = ce; 


then we have 
x= &h, + (hy. 


Let us map the phase plane P affinely onto the auxiliary plane P* of the 
complex variable ¢ in such a way that the vector h,; goes into 1 and the 
vector hg into z; then to the vector ¢!h, + £hg will correspond the complex 
number ¢ = £! + 7£”. Under this mapping, the phase trajectory (9) will 
be mapped into a phase trajectory on the plane P* described by the 
equation 

t= ce. (10) 


(C) Focus and center. We shall rewrite equation (10) in polar coordi- 
nates by setting 
¢ = pe”, c= Re. 


Thus we obtain 
p= Re“, g=vit+a, 


which is the equation of motion of a point in the phase plane P*. Tor 
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uw ~ 0 every trajectory turns out to be a logarithmic spiral. The corre- 
sponding picture on the plane P is called a focus. If u < 0, then the 
point approaches the origin asymptotically as ¢ increases, describing a 
logarithmic spiral. This is a stable focus [Tig. 32(a)]. If u > 0, then the 
point moves from the origin toward infinity, and we have an unstable 
focus [Fig. 32(b)]. If the number y is zero, then every phase trajectory 
except the state of equilibrium (0, 0) is closed, and we have the so-called 
center (Tig. 33). 


16] PHASE PLANE OF A LINEAR SYSTEM 121 


g2 


we fl 


Figure 35 


Figures 32 and 33 give a picture in an auxiliary phase plane; in the 
plane P the picture is affinely distorted (see, for example, Figs. 34 and 35). 

Above we investigated so-called nondegenerate cases where the roots 
Az and dg are distinct and different from zero. A small variation of the 
elements of the matrix (ai) does not change the general character of the 
behavior of phase trajectories in these propositions. The case of a center 
is an exception: for a small variation of the elements of the matrix (a;) the 
equality «1 = O can be violated, and the center will pass into a stable or 
unstable focus. Because of its importance this degenerate case (of the 
center) is included in the basic text of the section. The remaining de- 
generate cases will be investigated in Examples 1 and 3. 


EXAMPLES 


1. (Degenerate node.) If the matrix A of the system (1) has only one 
eigenvalue A, then there are possible two essentially different cases; in 
describing these cases we shall denote by A the transformation corre- 
sponding to the matrix A. 

Case I. There exists in the plane P a basis hy, hg consisting of two 
eigenvectors of the transformation A: 


Ah, = Ah, Ahy = Aho. (11) 
Case II. There exists in P a basis h,, hy such that 
Ah, = Ah,, Ah, = Ah, + hy. (12) 


The existence of a basis of one of the forms (11), (12) follows directly 
from Theorem 28, but here we shall prove this fact directly. Let h, be 
an eigenvector of the transformation A, and hy an arbitrary vector which 
is not collinear with h,. Then we have 


Ah, = Ah, Ah, = oh, + bho. 
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From this it is evident that the transformation A has, in terms of the basis 


hy, he, the matrix 
» 0 
a wp , 


so that \ and yu are its eigenvalues, and therefore » = dX. If a = 0, then 
the relations (11) are satisfied for the basis h,, ha. If a ¥ 0, then by 
replacing the vector h, by the collinear vector eh;, we obtain a basis which 
satisfies (12). 

By direct inspection we see that in case I the general solution of (2) 
may be written in the form 


x = ce hye + c?hoe™ => Xe. (13) 


This solution has the initial value (0, x9). For \ ¥ 0 every solution 
describes a ray emanating from the origin. For \} < 0 the motion is 
directed toward the origin [Fig. 36(a)] and for \ > 0, away from the 
origin (Fig. 36(b)]; for the case \ = 0, see Example 3. 

By direct inspection it is also seen that in case II an arbitrary solution 
of (2) has the form 


x = chhye™ + c?(hyt + ho)e™. 
If we expand this solution in terms of the basis hy, he, 
x= £'h, + hp, 
we obtain the equations of the trajectories in P in terms of the basis hy, he: 
f= (cl +cte’, g = ce, (14) 


An affine mapping of the phase plane P, which transforms vectors h; and 
he into unit orthogonal vectors directed along the coordinate axes of the 
plane P*, also transforms the trajectories of P into trajectories of P*, 
where the trajectories are already given in rectangular coordinates by (14). 
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We shall investigate the case when \ # O (for the case \ = 0 see Ex- 
ample 3). First let \ < 0. In this case we shall study the trajectories 
filling the plane P*. First of all from equations (14) it is evident that, by 
changing simultaneously the signs of c' and c?, we shall obtain a reflection 
of the plane P* with respect to the origin under which trajectories are 
transformed into trajectories. Thus it is sufficient to examine those tra- 
jectories which fill out the upper half-plane. For c? = 0, c! ¥ 0 we 
obtain two trajectories, one for c! > O and the other for c’ < 0. The 
first one coincides with the positive semiaxis of abscissas and the second, 
with the negative semiaxis of abscissas; the motion along both is directed 
toward the origin. We shall consider the trajectory c! = 0,c? > 0. We 
have 

t} _ c*te™, t? — c7e*, (15) 


For ¢ = 0 we obtain the point (0, ce?) on the axis of ordinates. As ¢ increases 
from zero, the point first moves to the right, then to the left, always 
descending toward the origin to, which it approaches along the trajectory 
which is tangent to the positive direction of the axis of abscissas. As ¢ 
decreases from zero to — oo, the point moves to the left and at the same time 
it ascends less rapidly so that the general tendency of its motion is in the 
negative direction along the axis of abscissas. If in equations (15) the 
constant c? is assigned all positive values, then the trajectories described 
in this manner fill out the entire upper half-plane [Fig. 37, (a)]. We have 
here a stable degenerate node. If \ > 0, then the trajectory is obtained 
from those described by a mirror reflection of the plane in the axis of 
ordinates [Fig. 37(b)], and the motion along them proceeds in the opposite 
direction, i.e., away from the origin of coordinates. This is an unstable 
degenerate node. The phase trajectories in plane P are shown in Fig. 38(a) 
and 38(b). 
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2. Let us study the linear homogeneous second-order equation with 
constant coefficients 
+ aé+ br = 0. (16) 
Replacing this equation with a normal system by the method presented 
in §4, we obtain 
in $4, t=y y= —be—ay. (17) 
The phase plane of system (17) may be taken as the phase plane of equa- 
tion (16). By direct inspection it is seen that the characteristic polynomial 
of (17) coincides with the characteristic polynomial of (16), that is, 


p> +apt+b. (18) 


Thus, if the roots of (18) are complex, then the phase plane of (16) is 
a focus or center. We shall study the phase plane in the case of real, 
distinct, and nonzero roots of (18). Let \ be a root of (18) and h = (h}, h?) 
its corresponding eigenvector. We have then, taking into account the form 


of system (17), h? = dhl 


so that the characteristic direction corresponding to the eigenvalue X is 
determined by a straight line having the equation 


y = rz; 


we shall call it the characteristic line. 

If the roots \, and Ap» are negative, then we have a stable node [see (A)]. 
In this case both characteristic lines pass through the second and fourth 
quadrants; the trajectories in the neighborhood of the origin are tangent to 
whichever of these lines is closer to the axis of abscissas [Fig. 39]. 

If \; and Apo are positive, then we have an unstable node [see (A)]. Both 
characteristic lines pass through the first and third quadrants; in the 
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neighborhood of the origin the trajectories are tangent to the line closer to 
the axis of abscissas (Fig. 40). 

If \; and dg have different signs, then we have a saddle; one characteristic 
line passes through the second and fourth quadrants and the other through 
the first and third quadrants. In the direction of the first of these lines, 
the trajectories approach the origin, and in the direction of the second, 
they recede from the origin (Fig. 41). 

3. We shall study, finally, the case when at least one of the eigenvalues 
of the matrix A is zero. 

Case I. Only one eigenvalue, 


1 #0, w=), 
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is zero. In this case the solution can be written in form (4), where £' and 
£? are defined by the formulas (5). Since \2 = 0, then ¢? = const, and 
the motion takes place along the straight line &? = const in the direction 
of the line £1 = 0 or away from it depending on the sign of the number A,. 
All points of the line 1 = 0 are states of equilibrium (T'ig. 42). 

If there exists only one eigenvalue A; = A2 = O, then the two cases in 
Example 1 can occur. 

Case I [see (11), = 0]. The general solution may be written in the 
form [see (13)] 

xX = Xo. 


This case occurs if all coefficients of (1) are zero; every point of the plane P 
is a state of equilibrium. 
Case II [see (12), = 0]. The general solution may be written in the 
form [see (14)] 
gl = cl 4 c%, 2 — 2, 


The motion takes place uniformly along each of the straight lines &? = 
const. All points of the line &? = 0 are states of equilibrium (Tig. 43). 


CHAPTER 3 
LINEAR EQUATIONS WITH VARIABLE COEFFICIENTS 


In this chapter we develop the theory of linear equations first for a 
normal nth-order system and then for one nth-order equation; almost all 
results are related to the equation derived from the corresponding results 
for the normal system, The third section of this chapter is devoted to 
normal linear homogeneous systems with periodic coefficients. The basic 
result here is Lyapunov’s theorem on transforming a normal system with 
periodic coefficients into a normal system with constant coefficients by 
means of a linear periodic transformation of the variables. Later this 
result will find an important application in stability theory. Its proof is 
very simple, but is based on a comparatively nonelementary part of the 
theory of matrix functions. This theory, which is not a part of the theory 
of ordinary differential equations, is presented for the convenience of the 
reader in the last chapter of this book, which is a purely algebraic supple- 
ment. Thus the third section (§19) of this chapter is nonelementary be- 
cause it contains matrix calculus. The material of this section is, ap- 
parently, presented for the first time in an undergraduate textbook. 


17. The normal system of linear equations. We shall study here the 
normal system of linear equations with variable coefficients 


= Vaidr’ +o, i=1,...,n. (1) 


j=1 


We recall that if gq: < ¢ < qe is the interval where the coefficients a}(t) 
and the free terms b'(£) of system (1) exist and are continuous; then, by 
Theorem 8, the interval gq, < ¢ < qe is the maximum interval of existence 
of every solution of (1). In the sequel we shall assume that every solution 
to be considered is defined on this interval and that every value ¢ considered 
belongs to it. 

The fundamental system of solutions. First of all we shall study the 
homogeneous system of equations 


i= > ai(t)z’, i= 1,...,%, (2) 


j=1 


or, in vector form, 
x= A(é)x. (3) 
127 
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(A) We shall establish the simplest properties of equation (3). (a) If 
x = g(t) is a solution of (3) which vanishes for a certain value fo, i.e., 


e(to) = 0, (4) 
then this solution is identically zero: 
p(t) = 0, g <t< q@. 
(b) If the vector functions 


vill), golt),..., er(t) 


are solutions of (3), then the vector function 
e(t) = cpi(t) + +++ + cg,(t), 


where c!,...,c” are constants, is also a solution of (3). 

Property (b) can be verified directly. Property (a) follows from the fact 
that the vector x = 0, which is identically zero, is obviously a solution 
of (3), and therefore the solution g(#), specified in (a) as having common 
initial conditions (4) with this solution, must coincide with it. 

(B) Let 

vill), go(t),..-, r(t) (5) 


be a system of solutions of (3). It is called linearly dependent if there exist 
constants c!, c?,...,¢’, not all simultaneously zero, such that 


clei (t) + c*pelt) +--+ + ce, (t) = 0. 


Otherwise, the system (5) of solutions of (3) is called linearly independent. 
It follows that if for even one value t = fo the vectors 


vilto), e2(to),.--, Prtto) (6) 


are linearly dependent, then the solutions (5) are linearly dependent. In 
other words, if the system (5) is lmearly independent, then for no value 
of ¢ can the vectors (5) be linearly dependent. 

We shall now prove this. Let us assume that the vectors (6) are linearly 
dependent, i.e., that 


clpi(to) +--+ + cpr(to) = 0, 
where not all the numbers c!,..., ¢” are zero. Let us set 


p(t) = clo: (t) +--+ + ee, (i). 
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By proposition (A) the vector function g(t) is a solution of (3), and is 
also identically equal to zero, since it vanishes at the point 1 = fo. 

We now turn to the concept of a fundamental system of solutions, which 
is most important for homogeneous linear systems. 

(C) The system of solutions of equation (3) 


¢1(4), v2(t), sty ¢n(t), (7) 


where n is the order of the system (2), is called a fundamental system of solu- 
tions if it is linearly independent [see (B)]. It turns out that (a) a funda- 
mental system of solutions always exists for the equation (3), and (b) if 
(7) is a fundamental system of solutions of (3), then every solution g(t) 
of (3) can be represented in the form 


elt) = epi (t) + c?go(t) +--+ + c*pn(t), (8) 
where c},..., ¢” are suitably chosen constants. (For a linear system with 
constant coefficients, the fundamental system of solutions was constructed 


in §14). 
We shall prove first of all that a fundamental system of solutions of 
(3) exists. Let 
@ 1, 22,..., an 


be an arbitrary system of constant, linearly independent vectors, where 
n is the order of (2). We shall define solutions (7) by the initial conditions 


Pi(to) = a, t= 1,...,%, 


where fg is a certain value of ¢. Since the vectors ¢1(éo), go(to), . - . On(to) 
are by assumption linearly independent, then proposition (B) implies that 
the solutions (7) are also linearly independent, i.e., they form a funda~ 
mental system. 

* We shall show that every solution g(t) can be written in the form (8). 
Let to be a certain instant of the time ¢; since the solutions (7) are linearly 
independent, the vectors gi(fo),..., @n(to) are also linearly independent 
[see (B)], and since their number is equal to the dimension of the vector 
space considered, they form a basis for it, so that the vector y(t9) may be 
written in the form 


(to) = c'pilto) +--+ + c"pnlto), (9) 
where the numbers c’,...,c” are suitably chosen. The solutions ¢(t) 
and clg,(t) +--+++ c"gn(t) have common initial conditions [see (9)] 


and therefore coincide, so that (8) is valid. 
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We shall now proceed to a description in coordinate form of the facts 
obtained and to the determination of certain other results. 
(D) Let 
¢1(4), re) ent) (10) 


be a certain system of solutions of equation (3). Let us express the solu- 
tion g,(¢) in terms of coordinates by setting 


ex(t) = (ve(f), veld), ..., ek (2). 
We form the matrix 
vill) --- get) «++ gald) 
ei(0) + gk) +++ eh], (11) 
ei) RD > OX 


where the solution y;(t) of (2), or more precisely, its coordinates, serve as 
the kth column. We denote the determinant of this matrix by W(i); it is 
called the Wronskian determinant of the system of solutions (10). It is 
evident that, if the solutions (10) are linearly independent, then the 
Wronskian W(t) cannot vanish for any value of ¢; in this case system (10) 
is a fundamental system of solutions. Furthermore, if (10) is linearly de- 
pendent, the Wronskian is identically zero. Whenever the system (10) is 
fundamental, we shall call the matrix (11) a fundamental matriz. 

We shall now prove that any given nth-order square matrix, consisting 
of functions of ¢ and satisfying certain natural conditions, is fundamental 
for a certain system of equations of the form (2). 

(E) We shall assume that the matrix (11) is an arbitrary given matrix 
of functions of ¢ which are continuously differentiable on the interval 
a1 < ¢ < qe, with a determinant which does not vanish on this interval. 
Then (11) is fundamental for some (unique) system (2) which is defined 
on the interval gq; < ¢ < qo. 

To prove this, we shal] write out the statement that the vector function 
y(t), whose coordinates form the kth column of the matrix (11), con- 
stitutes a solution of (3). We have 


a = Nadel, ik=1,...,0. (12) 


t=1 


If in this relation the index 7 is fixed and only the index k is considered to 
be variable, then the system of relations obtained can be taken as a system 
of linear algebraic equations in the unknowns aj(t),..., @,(t). This sys- 
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tem can be solved uniquely, since its matrix is obtained by a transposition 
of the matrix (11) so that its determinant does not vanish. Thus for every 
fixed 7 the functions ai(t) may be determined uniquely from (12). They 
then become continuous functions since the functions ¢i(#) and i(t) are 
continuous. 

Liouville’s formula. To prove proposition (G) we need a formula for 
the differentiation of a determinant, which we shall derive here. 

(F) Let (¢;(¢)) be an nth-order square matrix whose elements are 
differentiable functions of the variable £, and let W(é) be the determinant 
of this matrix. The derivative W(t) of this determinant may be found 
from the following formula: 


Wi) = Wi) + --- + W2, (13) 


where the term W,,(t) in the 7th place on the right-hand side of the equality 
is defined in the following way. In the matrix (¢i(¢)) all elements of the 7th 
row are differentiated with respect to ¢, the other rows remaining unchanged 
and the determinant of the matrix obtained being denoted by W,,(t). 
is evident that the role of the rows and columns may be interchanged. 
To prove (13), we first consider the determinant U of the nth-order 
square matrix (u‘) as a function of all the elements uj, i,j = 1,...,n 
of this matrix, and we assume these elements to be independent variables. 
Let us calculate the partial derivative 


aU 


our 
3 


of the function U with respect to the variable u;; here r and s are fixed. 
We denote the cofactor of the element u; in the matrix (w;) by V2, so that 


U = >> ujVi. (14) 


This formula gives us the expansion of the determinant U in terms of 
elements of the rth row. The cofactor V? does not depend on the variable 
us so that by differentiating (14) with respect to ui, we obtain 


aU _ ys. (15) 


our 
3 


If we set ui = ¢5(t), then we have U = W(t). Differentiating W(t) as 
a composite function, we obtain by (15) 


WO = FeO = Lsovi = > (2 #0 vi). 
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Since it is obvious that 


>) ivi = Wild, 


j=1 


formula (13) is proved. 

We now turn to the proof of the so-called Liowville formula. 

(G) Let W(é) be the Wronskian of a fundamental system of solutions 
of equations (2); then the formula 


t 
W(t) = W(t) exp @ S(r7) ar) (16) 
is valid, where 


S(t) = a(t) + az) +--+ + an(t). 


To prove (16), we shall introduce the differential equation which the 
Wronskian satisfies. 

We shall calculate the derivative W(t) of the Wronskian by means 
of (13). In order to simplify the calculations, we shall assume the rows 
of (11) to be vectors; that is, we assume that 


x'(t) = (i,..., ea), f= 1,..., 0. 


The relation (12) may now be written in the form 


X(t) = ai(t)xi(t) +--+ + ab (x7 (0), (17) 


which shows that the derivative of the 7th row of the matrix (11) is a linear 
combination of rows of the same matrix. Thus, in calculating the de- 
terminant W;(t), we must replace the ith row of the determinant W(t) by 
a linear combination (17) of rows of the same determinant. Since adding 
multiples of the other rows to the given row does not change the value of 
the determinant, the determinant W(t) may be obtained from the de- 
terminant W(t) by multiplying its ith row by ai(¢), so that we have 


Wit) = QW. 

Thus by (13) we obtain 
W(t) = S(t)Wit). 

The unique solution of this equation with the initial condition 
W()ltaty = W (to) 


is formula (16). Thus Liouville’s formula is proved. 
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The method of variation of parameters. We turn now to the study of non- 
homogeneous systems. Let 


y = Ay + bY) (18) 


be the vector form of the nonhomogeneous system (1), and let y = y(t) 
be a certain solution of this equation. Together with equation (18) we 
shall study the corresponding homogeneous equation (3). From §6 it 
follows directly that an arbitrary solution of (18) may be written in the 


form 
y= +, 


where ¢(é) is an arbitrary solution of (3). 

Thus the solution of the nonhomogeneous equation (18) reduces to the 
solution of a homogeneous equation and to the determination of a par- 
ticular solution of the nonhomogeneous equation. We shall show how, by 
knowing the fundamental system of solutions of the homogeneous equa~- 
tion (3), we can find by means of quadratures a particular solution of the 
nonhomogeneous equation. 

(H) (Method of variation of parameters.) Let 


#1(2), vty On(E) 


be a fundamental system of solutions of the homogeneous equation (3). 
We shall seek a solution of (18) in the form 


y = cl(t)gi(t) + «++ + e*(é)gn(t), 


where the coefficients are indeterminate functions of ¢. Substituting this 
value y into (18), we obtain 


é(t)oi(t) +++ + e*(en(t) + coil) + +++ + eM eGn(t) 
= A(i)[el(ei() +--+ + e*(en()] + dM), 


whence, remembering that g(t), ..., @n(é) are solutions of (3), we obtain 
EMei(t) + +++ + een) = b(t). (19) 


Since 9j(t),..., x(t) are linearly independent vectors at every point ¢, 
it follows from (19) that the quantities é'(z),..., é"(#) are uniquely de- 
termined, and therefore the values c'(é),..., c"(£) may be found by in- 
tegrating. Equation (19) in the quantities é'(t),..., é"(t) may be written 
in terms of the coordinates and has the form 


n 


De iO = b'O, t= 1s ym 


j=1 
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Matrix form of systems of linear equations. In a number of cases it is 
preferable to write equation (3) in its matrix form, where the fundamental 
matrix of (3) is the unknown value. We shall give this form here. 

(1) If (7) is a fundamental system of solutions of (3), then 


eit) = > da(t) 95 (t). 


a=] 
This relation takes the matrix form 
&(t) = A(t)&(2), (20) 


where &(t) is the derivative of the fundamental matrix ®(t) = (yi(t)) 
with respect to time ¢, that is, (2) = (¢3(¢)). Thus the fundamental matrix 
(tf) of (3) satisfies the matrix equation (20); in addition, every solution of 


the matrix equation ; 
xX = A(X, (21) 


where X is an unknown matrix, is the fundamental matrix of (3) only if 
the determinant of X is not zero. Hereafter, by a solution of equation (21) 
we shall mean only a matrix X which satisfies (21) and whose determinant 
is different from zero. It is obvious that finding one solution of (21) is 
equivalent to finding all solutions of (3). We note that if X = (¢) and 
X = &(2) are two solutions of (21), then there exists a constant matrix P 
such that 

&(t) = &()P. (22) 


Let us prove this last statement. If 


&(t) = (vi), 
&(t) = (439), 
e;(t) = (¢}),..., 7), 


oi) = (#30, ..., HO), 
then 
oi (8), sey Gril) (28) 


is a fundamental system of solutions of (3), and since $;(£) is also a solution 
of (3), it can be expressed in terms of the fundamental system (23), 
so that we have 


oilt) = Do pivalt). 
a=] 
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If we rewrite these relations in scalar form, we obtain 


ei) = D> ipl. (24) 


a=1 


The relation (22) represents the matrix form of (24) for P = (pi). 
(J) We shall introduce a new unknown vector y in equation (3) by 


using the transformation 
y = SOx, (25) 


where S(t) = (si(¢)) is a nonsingular matrix which depends on t. The 
equation for the new unknown vector function y has the form 


y= (S(t) + SOAW)S Wy, (26) 


i.e., we obtain again an equation of type (3). To the transformation (25) 
of the vector variable corresponds the transformation 


Y = S()X (27) 


of the matrix variable [see (I)]. 
We shall first derive equation (26) for the unknown y. We have 


y= < (Sx) = Sx + Sx 
= (S(t) + SHAW)x = (SH + SMA)S" Oy. 


In order to show that the transformaticn (26) of the matrix variable corre- 
sponds to the transformation (25) of the vector variable, we shall rewrite 
(25) in scalar form 
y= > si(t)z”. 
a=1 
By means of the formula 


nr 


Vi) = YS ss(He7) 


a=] 


the transformation (25) sets up a correspondence between the vector 
vi(t) = WO,..-,W@) and the vector 9;(t) = (¥(,..., ef) of 
the fundamental system (3). Thus the fundamental matrix W(t) of (26), 
given by the formula 

v(t) = SH, 


corresponds to the fundamental matrix ®(é) of (3), so that the matrix 
indeterminate transforms according to formula (27). 
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EXAMPLE 


Irom proposition (C) it is clear that in order to find all solutions of equa- 
tion (3) it is sufficient to find its fundamental system of solutions which 
contains n linearly independent solutions. We shall show that, if we know 
one nontrivial solution of (2), then we can lower the order of (2) by one, 
i.e., we can reduce the problem to the solution of a linear system of order 
n—1. Let 

e(t) = (¢',..-, e"@) 


be a solution of (3) or, what is the same thing, of (2). We seek a solution 
of (3) in the form 
x= up(t) + y, (28) 


where u is an unknown function and y an unknown vector, whose first 
component we can assume to be zero: 


y= (0,y”,...,y”). 
Substitution of the vector x from formula (28) into (8) gives 
u(t) + up(t) + y = Alé)(ue() + y). 
Since ¢(¢) is a solution of (3), we obtain 
tpt) + y = Alby. 


We express this equation in terms of the coordinates, writing separately 
the first of the equations so obtained: 


ty (t) = > a;(t)y’, (29) 
j=2 
y= > ai(t)y — a(t), t= 2,...,n. (30) 


j=2 


If we determine % from (29) and substitute the value obtained into (30), 
we have 


y= DVoOy, i=2,...,2, (31) 
j=2 


where ‘ () 
BO = a) — Fay ald 

It must be remembered that substitution of the quantity « from (29) 
into (30) is possible only on the interval where the function ¢'(t) does not 
vanish. Now if y(t) = {y7(t),..., ¥”(2)} is any solution of (31), then, 
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by determining u from the relation 
n . 
tet) = DP Ow’) 
j=2 
by means of quadratures, we obtain a solution of the original system (2) 
in the form 


x= u(t) + ¥(é). 


18. The linear equation of nth order. Here we shall study the nth- 
order linear equation 


y™ +4 ay(t)y@—P +++ + any = v0), (1) 


whose coefficients a;(#) and free term b(t) we assume to be defined and 
continuous on the interval gq; < ¢ < ge. In our study of equation (1) 
we shall reduce it to a normal system of linear equations by the method 
indicated in §4. 

The fundamental system of solutions. (A) In order to reduce equation (1) 
to a normal linear system, we introduce the new unknown functions 


gp=y w=y ..., may, 

These new unknowns z!,..., x” satisfy the linear system [see §4, (A)] 

#! = 2%, 

= 2°, 

gt = gt 

t” = —a,(t)x* — an—s(t)z” — +--+ — ay()z" + dd). 

We write the system obtained in vector form 
x = A(i)x + bid), (2) 


where the matrix A(t) has the form 


0 1 0 
0 0 1 0 
0 0 0 0 

, (3) 
0 0 0 1 


—a,(f) —dn_i(t) —@n—2(t) —ay(t) 
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and the vector b(t) is defined by the formula 
b(t) = (0,0,..., d(2)). 


Equations (1) and (2) are equivalent; that is, to every solution y = y(t) 
of (1) corresponds the solution 


x= eo) = WO, VO,-..., ¥P() 
of (2), and conversely, to each solution 
x= 9) = (o'), 7(0,---, 9") 
of (2) corresponds the solution 
y = ¢'() 


of (1), this correspondence being one-to-one. If the solutions ¥(é) of (1) 
and g(t) of (2) correspond in such a way, then we write 


¥) = ed). 


In particular, it follows from the equivalence of equations (1) and (2) 
that the maximum interval of existence of every solution of (1) is the entire 
interval gq; < ¢ < qe (see Theorem 3), so that henceforth we may assume 
that every solution of (1) under consideration is defined on this interval 
and every value ¢ under discussion belongs to it. 

We shall study first the homogeneous equation 


y™ 4 ay(t)y@—- + ++. 4+ an(t)y = 0. (4) 


Let 
x= A(d)x (5) 


be a system of equations in vector form corresponding to (4), where the 
matrix A(é) is defined by formula (3). 
(B) Let 
¥i(t), walt), nny ¥7(t) (6) 


be a certain system of solutions of equation (4). It can be verified directly 
that the function 
W(t) = c'Wilt) +--+ + c%G,(2), 


where c',..., c” are constants, is a solution of equation (4). The system 
(6) is called linearly dependent if there exist constants c’,..., c’, not all 
zero, such that 
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Thus if 
vi(t),.-., er(t) (8) 


are the solutions of (5) corresponding to the solutions (6): 
¥:(t) 2 g(t), i= 1, reg Ty 


[see (A)], then the solutions (8) are linearly dependent [see §17, (B)] if 
and only if the solutions (6) are linearly dependent. 

We shall prove this. Let us assume that the solutions (6) are linearly 
dependent so that equation (7) is valid. Writing out (7) and the relations 
obtained from it by differentiation, we obtain 


cWil) +---+cy,() =9, 
ca (t) a c(t) = 0, (9) 


WTP +o + eV MD = 0. 
If we remember that 
eit) = Wil, iO, -.-, 27), 


we see that the vector form of (9) is 
c'gi(t) +-+-+ c'¢,(t) = 0, (10) 


so that a linear dependence also exists between the solutions (8). Let us 

assume, conversely, that the solutions (8) are linearly dependent so that the 

relations (10) are valid. By replacing every vector ¢,(t) of (10) by its first 

component, we obtain (7), so that the solutions (6) are linearly dependent. 
(C) The system of solutions 


¥1 (2), see , Wnt) (11) 


of equation (4) is called a fundamental system if it is linearly independent. 
[The notation indicates that the number of solutions of (11) is equal to 
the order of (4).] Thus we find that fundamental systems of solutions (4) 
exist and that if (11) is fundamental, then every solution (4) can be written 
in the form 


v(t) = clvi(t) + +++ + eat), 


where c!,...,c” are constants. It is now clear that in order to find all 
solutions of (4) we need only find its fundamental system of solutions. 
(The fundamental system of solutions for a linear homogeneous equation 
with constant coefficients was constructed in §7 and §8.) 
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We shall first show that the fundamental system of solutions of (4) 
exists. In order to do this we shall utilize the existence of a fundamental 
system of solutions of (5) [see §17, (C)]. Let 


ei(t),..-, ent) (12) 
be a fundamental system of solutions of (5), and let 
vit), sey ¥nl(t) (13) 


be the solutions of (4) which correspond to the solutions of (12): 
¥i(2) 2 gilt), i= 1, rey tt 


[see (A)]. Since the solutions (12) are linearly independent, it follows from 
(B) that the solutions (13) are also linearly independent, so that they 
constitute a fundamental system. Let us now assume that the system (11) 
is fundamental for (4). Let the solutions (12) correspond to the solutions 
(11), and let ¥(¢) be an arbitrary solution of (4) and g(t) the correspond- 
ing solution of (5). Since, by assumption, the system (11) is fundamental, 
i.e., linearly independent, the corresponding system (12) is also linearly 
independent, i.e., fundamental. Thus, it follows from (C) of §17 that 


o(t) = clpy(f) +... + ce" ea(t). 
If we replace each vector in this system by its first component, we obtain 
v(t) = clyi(t) +--+ + call), 


so that (C) is proved. 
(D) The determinant 


V1 ‘) Vn (t) 
Wi) = no a(t) (14) 
YEP ee PPD 


is called the Wronskian determinant of the system of solutions (11) of 
equation (4). If the solutions (12) of (5) correspond to the solutions (11) 
[see (A)], then it is obvious that the Wronskian [see §17, (D)] of the system 
of solutions (12) of (5) coincides with (14). Thus, what is true for the 
Wronskian of (12) is also true for (14). Hence, by (D) of §17, we conclude 
that the determinant (14) either does not vanish at any point or else it 
vanishes identically; in order that the system of solutions (11) be linearly 
independent, i.e., fundamental, it is necessary and sufficient that (14) not 
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vanish. From formula (16) of §17 we can obtain the Liouville formula for 


determinant (14) ; 


W(t) = W(to) exp (-/ a,(7) ar) , (15) 


since the trace, i.e., the sum of the diagonal elements of the matrix (3), is 
equal to —a,(t). Below, in Example 2, we shall give a simple direct proof 
of (15). 
(E) Let 
2z™ + a,(t)2"—P +--+ + an(t)e = dt) (16) 


be a nonhomogeneous equation and let 
y + ay"? +--+ + any = 0 (17) 


be its corresponding homogeneous equation. Irom the propositions of 
§6 it follows directly that if Xo(¢) is a particular solution of (16), then an 
arbitrary solution of (16) has the form 


z= ¥(t) + Xol(t), 


where y(t) is a solution of (17). 
(F) (Method of variation of parameters.) Let 


be any fundamental system of solutions of equation (17). Then a solution 
of (16) can be obtained in the form 


z= ey) + +++ + c™()dn(f), (19) 


where the functions 
é'(t),..., e*(t) (20) 


are obtained as solutions of the system of algebraic equations: 


Wied) +--+ +e) =O, 
diet) +++ + va(der(t) 0, 


(21) 
PP MED +--+ + VE? HE = 0, 
ye pel(t) feee vo—Pe(t) = b(t). 


Since the determinant of the system of equations (21) with respect to the 
indeterminates (20) is the Wronskian of system (18), it follows from (D) 
that it does not vanish for any value of ¢; therefore, we can find the quanti- 
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ties (20) from (21) and then integrate these quantities to determine the 
required functions 


cl(t),..., e*(2). 

The proof of (F) follows froma (H) of §17. It can also be derived directly, 
which we shall proceed’ to do. Assuming that the quantities (20) are 
restricted by the conditions (21), we obtain the following relations by 
differentiating formula (19): 


z=c(Wil) +:---+c"byr(, 
cl(\vi(t)  +---+c"(ynlt), 


ae 
I 


Z—D = MOVED) +--+ OWE, 
= OOYPO +--+ OHO 
+ POPP +o + BOW? 
CWP) +--+ + (yn? @) + dM). 


Substituting these expressions into (16), we obtain an identity. Thus, if 
the functions c!(t),..., c*(é) satisfy (21), then the function (19) is a solu- 
tion of (16). 


EXAMPLES 


1. If a nontrivial (not identically zero) solution #(é) of equation (4) is 
known, then the order of this equation can be decreased by one, i.e., its 
solution can be reduced to the solution of a linear equation of order 
n — 1. In order to do this we make the substitution 


y = v(t), (22) 


where » is a new unknown function. We shall now show that the result 
of substituting (22) into the left-hand side of (4) leads to the equation 


bo(t)u'™ + by (uP +--+ + b,_1()6 + b,(Hv = 0 (23) 


for v, where 

bolt) = ¥@), bnr(t) = 0. (24) 
Since our whole investigation is correct for an nth-order equation in which 
the coefficient of the nth derivative is one, it is necessary to divide (23) by 
bo(f) = W(), so that (4) reduces to (23) only on an interval where y(t) does 
not vanish. If we set 


bit) 
vy — HO 
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and replace the unknown function v by a new unknown function 
w= d, 
we arrive at an equation 
we—Y + hw? + +--+ lhi(w = 0 


of order n — 1. If a solution X(t) of this equation exists, then we obtain 
the solution v of (23) by quadrature: 


v = Jx(d dt, 


and the solution y of (4) may be obtained by substituting this function » 
into (22). 

We shall prove that the substitution (22) reduces (4) to the form (23), 
while preserving the relations (24). Differentiating (22), we obtain 


y = vot ++, 


where terms containing derivatives of v of orders lower than k are not 
written. Therefore equation (4) takes the form (23), where bo(t) = y(t). 
Since ¥() is a solution of (4), v = 1 is a solution of (23). Substituting 
v = 1 into (23), we obtain b,(¢) = 0. Thus the relations (24) are proved. 

2. We shall prove Liouville’s formula (15) for one nth-order equation 
without using Liouville’s formula for a system [see formula (16), §17]. 
Here we shall use the differentiation rule for a determinant given in §17 
[see §17, (I*)]. Applying this differentiation rule to (14), we obtain 


WO = Wi +--+ Wi) +--+ + Wald, 


where W,(t) is the Wronskian W(t) in which the 7th row is differentiated. If 
4 <n, then, as a result of differentiating of the 7th row, we obtain a row 
coinciding with the (¢ + 1)th row of the determinant W(é), and therefore 


we have 
Wit) = Wot) = +++ = Wr_ilt) = 0. 


By differentiation of the nth row we obtain the row 


WIPO), .. va (8, 


which by (4) is a linear combination of rows of W(t), the nth row being 
taken with the coefficient —a,(é). Since the determinant W,(t) contains 
rows with the indices 1, ...,” — 1 of the Wronskian, these rows can be 
rejected in the linear combination, leaving only the nth row with the 
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coefficient —a,(t). Thus for the Wronskian we obtain the differential 
equation 


W(t) = —a (QW), 
the solution of which gives us Liouville’s formula (15). 


19. The normal linear homogeneous system with periodic coefficients. 
Among linear equations with variable coefficients a particularly important 
role is played by equations with periodic coefficients. The present section 
is devoted to an account of some of the properties of normal linear ho- 
mogeneous systems of differential equations with periodic coefficients. 
Lyapunov’s theorem is the most essential of these properties. The proof 
of Lyapunov’s theorem is given here, but it is less elementary than any of 
the previous developments in this book. It is based on the matrix calculus, 
the necessary details of which are presented in the last chapter of the 
book. 

Let 

X = A(X (1) 


be a normal linear homogeneous system of equations written in matrix 
form [see §17, (I}]. We shall assume that the coefficients of this system are 
periodic functions of time t with period 7, i.e., the matrix A(t) satisfies the 
condition 

A(é+ 7) = A(é). 


(A) For any (matrix) solution 
X = &(t) (2) 


of equation (1) [see §17, (I)], a constant (nonsingular) matrix C can be 
found such that 
ot +7) = 0)C. 


We shall call the matrix C the fundamental matrix for the solution (2). If 
X = &() is any other solution of (1), and C is its fundamental matrix, 


then we have . 
C = P~'CP, (3) 


where P is a certain nonsingular constant matrix. 

To prove the existence of the matrix C we observe that, together with 
the solution (2), the matrix #(f + 7) is also a solution of equation (1). 
In fact, 


ét¢+7) = Att + r)ot+7) = ASE +7). 
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Thus by formula (22) of §17, we have 
&(t +7) = &(0)C, 


where C is a constant matrix. 
To prove formula (3) we shall also utilize formula (22) of §17. Because 
&(é) is a solution of equation (1), it follows from the formula above that 


&(t) = S()P. 
Hence 
&(t+7) = &t+7)P = 0)CP = PCP, 


which also gives (3). 
(B) Equation (1) and the equation 


Y = BY (4) 


with a periodic matrix B(t) of the same period 7 as the matrix A(t) are 
called equivalent if there exists a linear transformation 


Y = S@)X 


{see §17, (J)] with a periodic matrix S(t) of period 7 which transforms (1) 
into (4). Hence formulas (1) and (4) are equivalent if and only if there 
exist solutions X = $(¢) and Y = W(t) of these equations with the same 
fundamental matrix. 

Let us prove this assertion. Let us first assume that (1) and (4) are 
equivalent. Let X = (t) be an arbitrary solution of (1) with afunda- 
mental C; then Y = W(t) = S(t)@() is a solution of (4), and we have 


Wit) = S¢4+7e¢+ 7) = SHOOt+ 7) = SHEOC = VOC. 


Thus the fundamental matrix C of solution ®(t) is also fundamental for 
the solution Y(é). 

We shall now assume that the solutions X = #(f) and Y = W(é) of (1) 
and (4) exist with the same fundamental matrix C; then we have 


eti+7) = BC, VWitr) = VOC. 
Dividing the second of these relations by the first, we obtain 
Witte t¢+7) = VOS ). 


Thus the matrix 
S(t) = Ve") 
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is periodic with period r, and we have 
Ya) = SM). (5) 


Since each of the solutions ©(t) and W(t) is uniquely determined [see §17(E)], 
it follows from (5) that equation (4) is obtained from (1) by means of a 
transformation with matrix S(t). 

As is evident from (A) and (B), to every equation of the form (1) which 
is considered unique up to an equivalence [see (B)], there corresponds a 
matrix C determined uniquely up to a transformation [see (3)]. In addition, 
the set of all invariants of the matrix C with respect to transformations of 
the form (3) constitutes a complete system of invariants of equation (1), 
which is uniquely determined up to an equivalence. 

It should be noted that everything said in (A) and (B) is true both for 
the case of real matrices and for the case of complex matrices. In the fol- 
lowing important theorem of Lyapunov (Theorem 12), we shall distinguish 
between real and complex cases. 


THEOREM 12. Any equation (1) is equivalent [see(B)] to the equation 
Y = BY, 


where B is a constant matrix. (The matrix B, generally speaking, is 
complex.) If in equation (1) the matrix A(é) is real with period 7 then 
this equation, considered as periodic with period 27, is equivalent to the 


equation ; 
Y = BiY, 


where the matrix B, is constant and real, and the matrix S(¢) taking (1) 
into Y = B,F is also real. 


We shall preface the proof of Theorem 12 by the following proposition: 
(C) Let ; 
Y = BY (6) 


be a system of linear homogeneous equations with constant coefficients 
written in matrix form. Here the matrix B is constant. It turns out that 


the matrix 
Y = 8 (7) 


[see (D), §33] is a solution of (6). 
To prove that (7) is a solution of (6), we shall write out the function 
e'? explicitly in the form 


e é 
PS E+IB+tS B+ B+... 
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Hence for the derivative (d/dt)e'? we obtain 


d o1B ? ae _..\— pt8 
ai’ =B(e+ m+ h wy = Be. 
Proof of Theorem 12. Let C be the fundamental matrix of some solution 
X = &(t) of equation (1). By (D) of §33 there exists a matrix B which 
satisfies the condition 
eF — C. 


We shall prove that equations (1) and 
Y = BY (8) 


are equivalent. In fact, proposition (B) implies that the matrix Y = e! 


is a solution of (8). Thus, if (8) is considered an equation with periodic 
coefficients with period 7, then the fundamental matrix of the solution 
Y = e'* is C, that is [see formula (20) of §33] 


QttnB _ otBTB _ obo. 


Since the fundamental matrices of the solutions of equation (1) and of 
equation (8) coincide, these equations are equivalent [see (B)], and the 
first. part of Theorem 12 is proved. 

We shall now assume that A(t) is a real matrix, (ft) is a certain real 
solution of equation (1), and C is the fundamental matrix of this solution, 


so that 
Ot +7) = SC. (9) 


Since #(t) is a real, the matrix C is also real. From (9) it follows that 
&(t + 27) = H+ NC = B~C?. (10) 
By (D) of §33 there exists a real matrix B,; which satisfies the condition 
eri — 02. 
We shall prove that equation (1) and the equation 
Y= BY, (11) 


considered as equations with period 27, are equivalent. Actually, the 
matrix e’#: is a solution of (11), so that if (11) is considered as an equation 
with periodic coefficients of period 27, then the fundamental matrix of the 
solution Y = e'?1 is C?. Since the fundamental matrices of the solutions 
of (1) and (11) coincide [see (10)], these equations must be equivalent. 
Theorem 12 is thus proved. 
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(D) Let C be an arbitrary nth-order square matrix, of which the absolute 
value of all the eigenvalues is smaller than a certain positive number p. 
The elements of the matrix C™, where m is an integer, will be denoted by 
™ so that C™ = (™ci). Then there exists a positive number 7, which 
does not depend on 7, j, m, such that 


\"cj] < rp”. (12) 
Hence, in particular, it follows that for an arbitrary vector x the inequality 
[C™x| < n?rp™|x| (13) 


is valid. In order to obtain the bound (12) we consider the series 


2 m 
2,2 z 
=] f 4 Ff wee aan seey 
f) tot pet + om 
whose radius of convergence is clearly p. I'rom Theorem 27 (see §33) it fol- 
lows that the matrix series 


2 
HC) = B+ E+ G4 to pee 


is convergent and, in particular, the numcrical series 


is also convergent. Since this series converges, its terms do not exceed a 
certain number r which may be chosen uniformly for all pairs of numbers 
(t, 7), so that the bound (12) holds. 
(E) Let 
x = A(t)x (14) 


be the vector form of the matrix equation (1), and let C be the fundamental 
matrix of some solution #(t) of (1). An eigenvalue \ of multiplicity k 
of the matrix C is called a characteristic number of multiplicity k of (1) 
and (14). Since, up to a linear transformation, the matrix C does not de- 
pend on a random sclection of the solution ®(é) of (1) [see (A)], the char- 
acteristic numbers of (14) and their multiplicities are determined here in 
an invariant way. If \ is a characteristic number of multiplicity & of (14), 
then the number (1/7) In X is called the characteristic exponent of multi- 
plicity k of (14). Let us assume that all real parts of the characteristic 
exponents of (14) are smaller than some number 7; then there exists a 
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positive number # such that for any solution g(t) of (14) the bound 


le®| < Rle(O)|e” (15) 


holds fort > 0. 
We shall prove (15). Let ®(¢) be a solution of (1) with initial condition 
$(0) = E; then any solution g(t) of (14) may be written in the form 


e(t) = B(t)p(0). (16) 

This may be verified by substituting (16) into (14). Furthermore we have 
t+) = &0)C, 
et + 27) = oC, 

&(¢ + mr) = B(t)C”, (17) 


Since the elements of the matrix ®(¢,) are bounded on the interval 
0 < t, < 7, there exists a positive number o such that 


|®(t1)x| < o|x| for O0<t <r. (18) 


Since, further, all the eigenvalues of the matrix C are smaller than e’” in 
absolute value, it follows from (13) that the bound 


|C™x| < n?re7™™|x| (19) 


holds for an arbitrary vector x. Now let ¢ be any positive number; we can 
find a nonnegative integer m such that f= mr +,,0 < t, <7. By 
virtue of (16) and (17) we have 


o(t) = (mr + t1)9(0) = P(t,)C"p(0). 
Hence in accordance with (18) and (19) we obtain 
le()| < on?re™|p(0)|. 


Since for 0 < t; < 7 the number e!” is not less than a certain constant 
c > 0, the last inequality can be written in the form 


on’r vt 
—  é 


|p(0)|. 


le()| < 


This yields the bound (15). 


CHAPTER 4 
EXISTENCE THEOREMS 


Here we shall first prove the existence and uniqueness Theorems 1, 
2, and 3 which were formulated earlier. Further study is given to the 
question of the dependence of a solution on the initial values and on the 
parameters, if the latter actually appear in the equations. We shall in- 
vestigate the dependence of a solution with fixed initial values on the 
parameters; then, by a quite simple technique, we shall convert the initial 
values into parameters. Thus the problem will be reduced to one of 
describing the dependence of a solution on the parameters. For both the 
case of initial values and the case of parameters, we shall prove that the 
dependence of the solution on these variables is continuous and that the 
solution is differentiable with respect to these variables. In both cases it is 
necessary to distinguish between local theorems and what we shall call 
integral theorems. In the first case, one can assert that certain properties 
of a solution hold on the time interval |t — to| < r, where r is a positive 
number which is generally speaking “small.” In the second case, it is 
assumed that for fixed values of the parameter or for fixed initial values 
there exists a solution defined on the time interval ry} < ¢ < re which 
can be “large.” It is then asserted that for initial values, or for values of 
the parameter which are very nearly fixed, a solution exists on the entire 
interval r; < t < re which is continuous for all values of these variables 
or differentiable with respect to all the variables. 

In addition to this body of material, we include in the present chapter 
a section on the first integrals of a system of ordinary differential equations 
and a study of a linear partial differential equation which is closely related 
to the concept of a first integral. The results of this section will not be 
used further in this book. 


20. Proof of the existence and uniqueness theorem for one equation. 
In this section we shall give a proof of Theorem 1, the theorem of existence 
and uniqueness formulated in §1, for the first-order equation 


&= ftt, x), (1) 


whose right-hand side is defined and continuous, together with the partial 

derivative df/dz, in a certain domain T of the tx-plane P. A more complex 

and general form of the proof of Theorem 1 is found in the proof of Theorem 

2 in the following section. In presenting the proof first for the case of one 

equation, it is our aim to bring out the basic ideas of this type of proof, 
150 
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which in the more general application is encumbered by details of a 
secondary character. We shall prove Theorems 1 and 2 by the method of 
successive approximations developed by Picard and applied in analysis 
for the proofs of many existence theorems. This method can also be used 
for the approximate calculation of a solution and is therefore of great 
practical value. In certain cases the method of successive approximations 
can be interpreted as a method of contraction mappings. The proof will 
be carried out with such an interpretation in order to show the relation 
between these two methods. The difference between these two methods 
will be brought out in the proof of Theorem 3. 

Basic ideas of the proof. The first step in proving Theorem 1 by the 
method of successive approximations is the transition from the differential 
equation to an integral equation, which we shall formulate as a separate 
proposition. 

(A) Let + = g(t) be a certain solution of equation (1) defined on the 
interval r; < & < re, so that the identity 


a(t) = f(t, oD), (2) 
is satisfied, and let 
e(to) = Xo (3) 


be a certain initial condition which this solution satisfies. We then find 
that for the function ¢(¢) the integral identity 


g(t) = Xo +f f(r, e(r)) dr (4) 


is satisfied on the entire interval r; < ¢ < ro. Conversely, if a continuous 
function g(t) satisfies the identity (4) on the interval ry; < t < ro, then 
the differentiable function + = ¢(t) is a solution of the equation (1) 
and satisfies the initial condition (3). In other words, the integral equa- 
tion (4) is equivalent to the differential equation (2), together with the 
initial condition (3). 

We shall prove this. Let us assume first that the relation (4) is satisfied. 
If we replace the variable ¢ by its value to, we obtain y(to) = 20, so that 
(3) follows from (4). Moreover, the right-hand side of (4) is obviously 
differentiable with respect to t, whence the left-hand side is also differ- 
entiable with respect to ¢. By differentiating (4) we obtain the identity (2). 

Let us now assume that the relations (2) and (3) are satisfied. By in- 
tegrating (2) from ty to ¢ we obtain 


ett) — ¢(to) -| f(r, e(r)) dr. 
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We then obtain equation (4) from the last equality by virtue of (3). 
Thus proposition (A) is proved. 

We shall now introduce a certain notation which we shall use below 
in the proof of Theorem 1. 

(B) Let x = ¢(t) be a continuous function defined on some interval 
ry <#< re such that its graph is contained completely in IT, and let to 
be a certain point of the interval r; < t < rg. Then, if we use the right- 
hand side of (4), we can set up a correspondence between the function 
g(t) and the function ¢*(é), which is also defined on ry < t < ro, by 
means of the equality 


e*(t) = Xo + f(r, e(r)) dr. (5) 


(The graph of the function ¢*(é), of course, need not pass through I.) 
Thus the right-hand side of (4) can be regarded as an operator establishing 
the correspondence between the functions ¢* and ¢. Denoting this operator 
by the single letter A, we write (5) in the form 


e* = Ag. (6) 
By using the operator A, the integral equation (4) can be written in the 
form 

e = Ag. (7) 


(C) Let v(t) be a certain continuous function defined on the interval 
ry <t <r. The maximum modulus of this function, 


lll = max le(h 

will be called its norm ||y||. If ¥@) and x(¢) are two continuous functions 
defined on the interval r; < ¢ < re, then the norm ||\y — x|| of their 
difference y(t) — x(é) is a nonnegative number which expresses the extent 
to which these functions differ: if the number || — x|| is small, then the 
functions y and X are “close” to each other. The equality ||¥ — x|| = 0 
is valid if and only if y and x coincide identically. The notion of uniform 
convergence, which is familiar from a course in analysis, may be formulated 
easily in terms of the norm. Let 


golf), 1(t),..., o(t),... (8) 


be a sequence of continuous functions defined on the interval r; < t < re. 
The sequence (8) converges uniformly to a function ¢, defined on the same 
interval ry; < t < ro, if 

lim |lg — ¢;|| = 0. 


ta 
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In order that the sequence (8) converge uniformly, it is sufficient that the 
inequalities 
leiga — gill < a: 


be valid, where the numbers ao, @),..., @:,... form a convergent series. 

Before proving Theorem 1 in detail, we shall outline briefly the method 
of successive approximations, which is applicable to the solution of equa- 
tion (7). We form the sequence 


PO, Ply - ++) Pipe es (9) 


of continuous functions which are defined on a certain interval ry; < ¢ < ro 
containing the point to. Every function of the sequence (9) is determined 
by the preceding one by means of the formula 


i441 = Agi, a = 0, 1, 2, tee (10) 


If the graph of the function y; passes through T, then y;+, is defined by 
(10), but in order to define the next function ¢;42 it is necessary that’ the 
graph of ¢;41 pass through ©. This, as we shall show, can be achieved by 
selecting a sufficiently short interval r; < ¢ < re. Furthermore, by de- 
creasing the length of the interval r; < ¢ < re, we may also show that 
the elements of the sequence (9) satisfy the inequalities 


leita _ gill < klles —_ ¢i—i|l, t= 1, 2, sey (11) 
where 0 < k < 1. From (11) follow the inequalities 
leer. — vill < lle: — goll-k’, += 1,2,..., 


so that the sequence (9) is uniformly convergent [see (C)]. Furthermore, 
it is also easily established that the limit y of (9) satisfies equation (7). 

The same construction can be described in a somewhat different way 
by the method of contraction mappings. We shall select a certain family 
© of functions defined on the interval r; < & < rg (where ry < to < ro) 
so that the graphs of these functions pass through the domain [f. We 
shall assume, in addition, that with respect to the operator A the family Q 
satisfies the following two conditions: (a) whenever A is applied to any 
function of Q, we again obtain a function of 0; (b) there exists a number k, 
0 < k < 1, such that for two arbitrary functions y and x of Q the in- 
equality 

Ay — Ax|| < klly — x| 


is satisfied. In this sense the “mapping” A is a contraction mapping 
(it would be more correct to say “contracting”). 
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0 


Ficure 44 


It is easy to see that, if the conditions formulated are satisfied for the 
family Q, then, by starting from an arbitrary function go of 2, we obtain 
by the induction formula (10) an infinite sequence (9) satisfying (11) and, 
as was noted above, converging uniformly to a solution ¢ of (7). 

We shall now turn to the proof of Theorem 1 on the basis of these 
remarks. 

Proof of Theorem 1. The initial values tg and 29 of the solution of equa- 
tion (1) to be found are coordinates of the point (fo, 9) in T. First of all, 
we choose any rectangle II with its center at (fo, Zo), its sides parallel to 
the axes, and with its boundary entirely contained in I (Fig. 44). We 
shall denote the length of a horizontal side of II (parallel to the ¢ axis) 
by 2g and the length of a vertical side by 2a. Thus the point (£, x) be- 
longs to II if and only if the equalities 


lé—tl Sq |x — al Sa (12) 
are satisfied. Since II is a closed set which is contained in I, the continu- 
ous functions f(t, rz) and df(t, x)/dx are bounded on I and therefore 
two positive numbers M and K exist such that for any ¢ and 2 satisfying 


(12), the inequalities 


<k (13) 


1 <M, wea) 


are valid. 
Along with the rectangle II we shall consider a “narrower” rectangle II,, 
defined by the inequalities 


Jf— to] S7,  & — Xo] Sg, 


where r<@q (14) 
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(see Fig. 44). We shall determine the number r more precisely. We denote 
by Q, the family of all continuous functions defined on the interval 
| — to| < r whose graphs pass through II,. Thus the function y, which 
is defined on | — to| < 7, belongs to Q, if and only if the inequality 


le(t) — xol S a (15) 


is satisfied for any ¢ belonging to this interval. 

We shall now endeavor to select the number r in such a way that the 
following two conditions will be satisfied: 

(a) If the function ¢ belongs to Q,, then the function g* = Ag [see 
(5) and (6)] also belongs to Q,. 

(b) There exists a number k, 0 < k <1, such that the inequality 


|Ay — Axl] < kllp — xl| (16) 


is valid for any two functions y and x of Q,. 
Let us consider the condition (a). In order that 9* = Ag belong to Q,, 
it is necessary and sufficient that the inequality 


le*(t) — ro] Sa 


be satisfied for |£ — t | < r. By virtue of (5) and (13) we have 


le*(t) ~~ Xo| = | f(r, g(r) dr < Mr, 


from which it is evident that condition (a) is satisfied for 


r<u- (17) 


Let us now consider condition (b). We have 


v*(t) = zo + / f(r, W(r)) dr, 


x*(t) = ro + / f(r, x(r)) dr. 


Subtracting the second equality from the first, we obtain 


lyr) — x*(t)| = | / [f(r, w(r)) — f(r, x(r))] ar 


if f(r, Wr) — F(z, x(z))| dr} - (18) 


lA 
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Let us now estimate the size of the last integrand by means of Lagrange’s 
formula and the second of the inequalities (13): 


ie, wer)) — Hl, x@@))| = [XS (W — x) < K- WO) — XI; 
(19) 


here 6 is a number between ¥(7) and X(7r) and, consequently, it satisfies 
the inequality |@ — x9| < a. From (18) and (19) we have 


Ay — Ax|| = |l¥* — x*I| < Krllp — xl]. 


Thus (b) is satisfied if the number k = Kr is less than one, i.e., if 


(20) 


ml 


r< 


Thus, if r satisfies (14), (17), and (20), then the conditions (a) and (b) are 
satisfied for the family 0,. In what follows we shall assume that r has 
been chosen in such a way that (14,) (17), and (20) are satisfied. 

We shall now construct a sequence 


PO) Ply +++ Pipe ss (21) 


of functions defined on the interval {tf — éo| < r by setting 


Il 


voll) = to, (22) 
ein1 = Av, 1=0,1,2.... (23) 


Since the function (22) belongs to the family Q,, all functions of (21) also 
belong to the same family [see condition (a)]. In addition, we have 


ler — goll = , max le1 — %| <a 
|t—tolSr 


[see (15)}. By virtue of (16) we obtain 
ler41 — gil] = |Agi — Agi-all < & |les — ec-rl, 


whence 
ler44 _ ell < ak’, = 0, 1, 2, cee 


Thus by (C) the sequence (21) converges uniformly on the interval 
|t — to| <r to a continuous function ¢. Since all functions of (21) 
belong to Q,, the function ¢ also belongs to Q, [see (15)]. We now show that 
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the function ¢ satisfies (7). To do this, we note that the sequence 
Ago, Ag1,..., A@i..-- 
converges uniformly to the function Ag; actually, we have 
|Ay — Agill < klle — gill 
Taking the limit as 7 — oo in relation (23) we obtain 
yg = Ag. 


Thus, we have proved that a solution x = g(t) of equation (1) exists which 
satisfies the initial condition (3); moreover, we have established that the solution 
x = g(t) is defined on the interval |t — to| <r, where r is an arbitrary 
number which satisfies (14), (17), and (20). 

We now proceed to the proof of uniqueness. Let x = y(t) and x = x(t) 
be two solutions of equation (1) with common initial values to, x9, and 
let ry < t < req be the interval which is the intersection of the intervals 
of existence of the solutions y and X; it is clear that r) < to < re. Let us 
denote by N the set of all such points of the intervalr,; < ¢ < rg at which 
the solutions y and X coincide. The set N is nonempty since it contains 
the point to. We shall show that the set N is open and that it is closed 
on the interval ry < t < ro. 

We shall prove first that the set N is open. Let ¢, be an arbitrary point 
of N; since the solutions ¥(f) and x(é) coincide at this point, that is, since 
W(t) = X(t,) = x1, the point (¢;,2,) can be regarded as the common 
initial value for both solutions. In this sense (¢,, x) is no different from 
(tp, Zo), and therefore we shall preserve for (t,, x,) the notation (to, zo); 
this will allow us to keep the previous notation. Going over from the 
differential equation (1) to the integral equation (4), we obtain for both 
y(t) and X(t) integral equalities which in terms of operators can be written 
in the form 

y = Ay, x= AX. (24) 


As before, we now choose first a rectangle II in I with its center at the 
point (to, xo), and then a rectangle II, in such a way that the number r 
will satisfy still another condition in addition to the inequalities (14), 
(17), and (20): namely that the functions y and xX are defined for 
| — to| < rand satisfy the inequalities 


l(t) — rol < a, x(t) — aol < a; 


this is possible since y(t) and X(t) are continuous. Then on the interval 
|t — to| <r the functions y(t) and x(t) belong to the family 0,, and 
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consequently, by (16) and (24), we obtain 
lv — x|| = ||Av — Axl] < Ally — xl, 


which is possible only when ||y — x|| = 0, i.e., when y and X coincide on 
the interval |t — to| <r. Consequently, we have established that for 
every point to of N, the set N also contains some interval |t — fo| <r 
with its center at to, so that the set N is open. 

We shall now prove that the set N is closed on the interval r; < t < ro. 
This means that we must show that if the sequence {, tz,...,t,... of 
points of N converges to some point # of the interval r; < t < re, then ¢ 
also belongs to N. We have 


v(t) = X(t), t= 1,2,.... 


Since y(t) and x(¢) are defined and continuous at 7, we may pass to the 
limit as 7 — o to obtain ¥(@) = x(é), so that the point # also belongs to 
the set N. 

In proposition (D) below it will be shown that since the nonempty set N 
is both closed and open on a certain interval, it must coincide with this 
interval. Thus we shall have established that the solutions y(é) and X(t) 
coincide on the entire interval r; < ¢ < ro. Theorem 1 will thus be 
proved. 

(D) Let ry < ¢ < reg be any interval of real numbers and N a non- 
empty subsct which is both open and closed on the interval r; < ¢ < 19; 
then the set N coincides with the interval r} < t < 19. 

Since the set N is nonempty, it contains at least one point ¢). Let us 
assume that there exists on the interval r; < ¢ < re a point ¢t, which 
does not belong to N. To be definite, we assume that t; > tp. We denote 
by NV the set of all points ¢ of the set N which satisfy the inequality t < t, 
and let — be the least upper bound of the set N,. Since N, is obviously 
closed on the interval r; < ! < rg, the point @ belongs to N,; and there- 
fore cannot coincide with t,, so that — < t;. But since N is open, it must 
contain an entire interval with the point / and therefore ~ cannot be the 
least upper bound of the set N,. This contradiction proves proposition (D). 


EXAMPLE 


Let us find a solution by the method of successive approximations for 


the simple equation 
t= 2. 


We shall seck the solution with initial values 


to = 0, r= 1. 
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The corresponding integral equation can be written in the form 


g(t) = 1 +f (Tr) dr. 


If we form the sequence 
20) Ply s+ Pipe ery 


we have 


vote) = 1, 


oo —14fara144 
, 
oat = 14 +ndr=1+t+s@, 
t 
ext = 1 +f Gtrtge)ersititgerae, 


ol) =1ttt GO + eet tae 


The limit of this sequence (which is uniformly convergent on any interval 
on the number axis) is the function g(t) = e’. 


21. Proof of the existence and uniqueness theorem for a normal system 
of equations. Here we shall prove the theorem of existence and uniqueness, 
Theorem 2 which was formulated in §3 for a normal system of equations 


z = fi(t,c',2?,...,2”), tZ=1,...,7; (1) 


the right-hand sides fit, zi, x?, ...,2") of (1), together with the partial 
derivatives df*(t,z!,..., x")/dx’, 7,7 =1,..., n, are defined and contin- 
uous in a domain I of the space of the variables ¢, x'!,..., 2". If we set 


x = (x!,...,2”), (2) 

f(t, x) = (f'(t,x), P(t, x), .--,f"(,%)), 

we may rewrite (1) in the vector form [compare §14, (A)] 
x = f(t, x). (3) 


We shall carry out the proof in vector form by the method of successive ap- 
proximations; this will be an almost verbatim repetition of the proof of 
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Theorem 1] given in the preceding section. In addition to the proof of 
Theorem 2, we shall also present here the proof of Theorem 3 by a some- 
what modified form of the method of successive approximations. 
Auriliary propositions. In order that we may use vector notation freely, 
we shall establish first of all certain natural definitions and simple in- 
equalities for vectors and vector functions. 
The length or modulus |x| of the vector (2), as we know, is defined by the 


formula 
Ix] = + -V(r1)2 +--+ + (27)2. 
It is an easily proved fact that if x and y are two vectors, then the inequality 
Ix + y| < |x| + ly| 


is valid. From this inequality follows a similar inequality for an arbitrary 


number of vectors X;,..., Xz, namely, 
Ixy tee) +X, < [xi] +--+ + |xd. (4) 
Let g(t) = (¢1(),..., e*(t)) be a continuous vector function of a real 


variable f, i.e., a vector whose coordinates are continuous functions of ¢. 
If the function g(¢) is defined on the interval ry < ¢ < re, then for 
ry < tg < rg on the same interval we can define the vector function 


t 
y(t) = / y(t) dr, 


whose components y1(t),..., ¥”(t) are defined by the formulas 


y(t) = / g(r) dr; 


t t 
| g(r) dr] < | lp(r)| dr 


is valid. To prove this inequality we shall divide the interval of integra- 
tion into m equal parts by setting 


here the inequality 


(5) 


A ; t, = to + kA, k= 1,...,m, 


where the number A will be positive for ¢ > to and negative for t < ft. 
Then, by the definition of the integral of a vector function and by virtue 


21] PROOF FOR A NORMAL SYSTEM OF EQUATIONS 161 


of (4), we have 


/ . p(T) dr 


lim > ote < lim > le(tx)| + [Al 
Ma k=1 


MoO k=1 


= i le(r)| dr} - 


We shall establish an inequality for the vector function 


g(x) = (gi(x!, srry x”), vey g(x", srry ")) 
of the vector variable x defined on a convex set A of the space of variables 
z!,...,2". We shall assume that the inequalities 
i(yl 
ears pee y t") <K 
Oxi 


are valid, where K is a positive number. Then for any two points x and y 
of the set. A the inequalities 


le(x) — g(y)| < n?K|x — y| (6) 
are valid. 
To prove inequality (6) we introduce the notion of the segment joining 
x and y, namely, we set 


z(s) = y + s(x — y). 


Whenever s runs over all values of 0 < s < 1, then z(s) takes all values 
of the segment joining the points x and y, and always remains in this seg- 
ment because of the convexity of the set A. We obtain (by applying the 
Lagrange formula) 
i i 5 i dg’ (z(s 
g(a) — g'ty) = o'(2)) — g'(e(o)) = 22) 
s=0 

By calculating dg'(z(s))/ds by the formula for the derivative of a com- 
posite function, we obtain 


dg’(z(s)) _ dg'(z*(s),.-.,2"(s)) _ >> ag'(z1(s),...,2"(s))  de*(s) 
ds ds - axk ds 


k 
—¥"); 


= yy Aa) 8) (gt 
k=1 


Oxk 


and therefore 


lo) — gy) < DO KR" — y'| < DO Kix — y| < nK[x — yl. 
k=1 k=1 
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By squaring the last inequality, summing over 7, and extracting the root, 
we obtain 
le(x) — g(y)| < n°?K|x — y| < n°K|[x — yl. 


We proceed from the differential equation (1) to an integral equation in 
the same way as in the proof of Theorem 1. 
(A) Let x = g(t) be a certain solution of the differential equation (3), 
so that the identity 
o(t) = f(t, et) (7) 


is valid, and let 
(to) = Xo (8) 


be the initial condition which this solution satisfies. 
We shall now prove that the pair of relations (7) and (8) is equivalent 
to one relation, 


t 
p(t) = Xo ~| f(r, (r)) dr. (9) 


Assuming that the integral identity (9) is satisfied and substituting t = to 
into it, we obtain (8), which, when differentiated with respect to t, yields 
(7). We shall now assume that (7) and (8) are satisfied. Integrating (7) 
from to to ¢ and taking into account (8), we obtain (9). 

(B) By using the right-hand side of (9), we can make every vector 
function g(t) whose graph passes through domain I correspond with the 
function y*(é) by setting 


t 
e*(t) = Xo + [. f(r, o(r)) dr. (10) 
We may write the same relation briefly in the operator form 
o* = Ag. (11) 
Equation (9) may now be written 
yg = Ag. (12) 


(C) Let g(t) be a continuous vector function defined on the interval 
71 <t < re. We define the norm ||g|| of this function by setting 


= max t)}. 
llol| ox le(t)| 


By using the concept of the norm, we may formulate the condition of 
uniform convergence of the sequence 


PO: Pir +--+) Pye es (13) 
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Figure 45 


of vector functions defined on the interval r; < t < re. The sequence 
(13) of vector functions converges uniformly to a vector function ¢ de- 
fined on the same interval r; < t < ro, if 

lim || — ¢il| = 0. 

ta 
In order that the sequence (13) be uniformly convergent, it is sufficient 
that the inequalities 

lloeqa —_ ¢:l Sa 


be satisfied, where the numbers do, a1, ..., a:, ... form a convergent series. 
We now proceed to the proof of Theorem 2. 
Proof of Theorem 2. Since the point (to, Xo) = (to, x0, 2, ... , 2%) be- 


longs to I, there exist positive numbers g and a such that all points (¢, x), 
which satisfy the conditions 


lt ~~ to| < q |x _ Xo| < a, (14) 


are in I. Since the set II consisting of all points (¢, x) satisfying (14) is 
closed and bounded (T'ig. 45), the continuous functions 


af'(t, © 


ai? Zj=l,...,n, 


f(t, x)| and | 


are bounded on ZT, i.e., there exist positive numbers M and K such that 


If(t, x)| <M, pace 


su | SK 4G =1---40 (15) 


on the set II. 
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Along with II we consider its subset I], which is defined by the in- 
equalities 
lt — tol <r, |x — Xo| < a, 


where 
r<q (16) 


(Tig. 45). We shall denote by Q, the family of all continuous vector func- 
tions defined on the interval | — to| < r whose graphs pass through II,. 
Thus the function g, defined on the interval |t — t)| < r, belongs to the 
family Q, if and only if the inequality 


lp(t) — Xo] S a (17) 


is satisfied for any ¢ on this interval. 

We now choose r in such a way that the following two conditions will 
be satisfied: 

(a) If the function g belongs to the family 0,, then the function g* =Ag 
[see (10) and (11)] also belongs to Q,. 

(b) There exists a number & on the interval 0 < k < 1 such that for 
any two function y and X of the family ©, the inequality 


|AY — Ax]| < kily — xl (18) 


is valid. 

Let us consider condition (a). In order that the function g* = Ag 
belong to 9,, it is necessary and sufficient that for |£ — to| < r the in- 
equality 

le*(t) — Xo] S a 


be satisfied. By virtue of (10), (5), and (15), we have 


¢ t 
lp*(t) — Xo| = | f(r, p(r)) dr| < / lf(7, e(r))| dr] < Mr. 
0 0 
Irom this it is evident that condition (a) is satisfied for 
r< x (19) 


Let us now consider condition (b). We have 


W*(t) — x*O| = | (£(7, ¥(r)) — £(7, x(7))) dr 


< if [f(r, ¥(r)) — f(r, x(r))| dr} - (20) 
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We shall now find an estimate for the last integrand by using inequalities 
(6) and (15): 


lf(r, ¥(r)) — f(r, x(r))| < n?K|y(r) — x(r)]. (21) 
From (20) and (21) it follows that 
Ay — Ax] = |lv* — x*l| < n?Krlly — x]. 


Thus condition (b) is satisfied if 
k 


r < nek’ (22) 


where k < 1. 

Thus if the number r satisfies the inequalities (16), (19), and (22) 
(which we shall henceforth assume to be satisfied), then the conditions 
(a) and (b) are satisfied for the family Q,. 

We now form the sequence of vector functions 


Polt) =X, gilt),..-,eilt),..-, (23) 
defined on the interval |! — to] < r by setting 
i411 = Ag,, i= 0, l, cee (24) 


Since the function go belongs to the family Q,, all functions of (23) belong 
to the same family (see condition (a)]. Further, we have [see (17)] 


lor — goll = max, |gilt) — Xo| < a. 
|t—tg| Sr 
Irom (18) we obtain 


ler _ ml = | Ag: —_ Ag:_1|| < klle: _ %:-1||; 


whence 
llor41 — ill < ak’. (25) 


Thus by virtue of (C), the sequence (23) converges uniformly to a con- 
tinuous function ¢ belonging to 2,.. We shall show that the function ¢ 
satisfies equation (12). For this we observe that the sequence 


Ago, Ag, sey Ag, se 
converges uniformly to the function Ag; actually, we have [see (18)] 


|Ag — Agill < klle — gill. 
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If we allow 7 — oo in relation (24), we obtain 
g = Ag. 


Hence we have proved the existence of a solution x = g(t) of equation (3) 
satisfying the initial condition (8); morcover, the solution x = g(t) is defined 
on the interval |t — to| < 1, where r is an arbitrary number satisfying (16), 
(19), and (22). 

We now proceed to the proof of uniqueness. Let x = y(é) and x = X(t) 
be two solutions of equation (3) with the common initial values to, Xo, 
and let r; < t < rg be the intersection of the intervals of existence of 
the solutions ¥ and X; it is obvious that r} < t9 < ry. We shall denote 
by N the set of all points of the interval ry < £ < rg at which the solu- 
tions y and X coincide. The set N is nonempty since it contains the 
point fo. We shall show that N is open and that it is closed on the interval 
r; <t <r. From this it will follow, by proposition (D) of §20, that NV 
coincides with the interval r; < ¢ < re, that is, the solutions y and x 
coincide identically on this entire interval. 

First we shall prove that the set N is open. Let ¢, be an arbitrary point 
of N; since the solutions y(t) and X(£) coincide at this point, so that y(¢;) = 
X(¢,) = x1, then (¢,, X,) can be taken as common initial values for both 
solutions. In this sense, the point (¢;, x,) does not differ from the point 
(to, Xo), and therefore we shall use for (¢,,x,) the designation (to, Xo), 
since this will permit us to retain the previous notation. Going over from 
the differential equation (8) to the integral equation (9), we obtain for 
both ¥(@) and X(#) integral equalities, which, in operator form, can be written 


y= Ay, X= AX. (26) 


We choose again the set II which is contained in the domain I with its 
center at the point (fo, Xo) [see (14)]; we then choose the set II, in such a 
way that the number r, in addition to satisfying inequalities (16), (19), 
and (22), also satisfies the condition that the functions y and x are defined 
for | — to| < rand satisfy the inequalities 


IA 


W(t) — Xo] 
|x(t) — Xo 


a 


y 


lA 


a. 


This is possible, since y(t) and x(£) are continuous. Then W(t) and x(é), 
|t — é| <r, are contained in the family Q, so that by (18) and (26) we 
obtain 


ly — x]] = ||Av — Axil < kl — x], 
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which is possible only if ||¥ — x|| = 0, that is, if y and x coincide on the 
interval |t — fo| < r. Hence, whenever fo is contained in N, the set N 
will also contain some interval |t — to| < 7 with its center at to, so that 
N is open. 

We shall now prove that the set N is closed on the interval r; < t < re. 
To do this we must show that if the sequence fy, to,..., ¢;,... of points 
of N converges to a point £ of the interval ry < ¢ < re, then # must also 
belong to VN. We have 


v(t.) = X(t), 7=1,2,.... 


Since the functions y(t) and x(¢) are defined and continuous at ¢, then by 
passing to the limit as 1 — o, we obtain ¥(f) = x(é), so that f also 
belongs to N. Thus Theorem 2 is proved. 

We shall now state as a separate proposition certain facts established in 
the proof of Theorem 2 which will be needed later. 

(D) Let us assume that the right-hand sides of the system (1) [or, in 
vector form, of equation (3)] are defined and continuous, together with 
their partial derivatives af*/dz’, in T. Let (to, Xo) be a point of I, and let 
q and a be positive numbers such that the set II consisting of all points 
satisfying (14) is contained in IT. Furthermore, let M@ and K be positive 
numbers such that the inequalities (15) are satisfied for all points (é, x) 
satisfying (14). Finally, let r be any positive number which satisfies (16), 
(19), and (22). Then the solution of equation (3) with the initial values 
(to, Xo) is defined on the interval |f — to| <r. In addition, the solu- 
tion may be obtained on the interval |f — fo| < ras the limit of the 
sequence (23) which is defined inductively by (24), since inequality (25) 
is satisfied for these functions. 

Proof of Theorem 3. We shall proceed to the proof of Theorem 3, which 
asserts that for the normal linear system 


= s ai(t)e? + b(t) = fit, c',...,2"), t=1,...,n, (27) 


j=1 


whose coefficients a,(t) and free terms b‘(t) are defined and continuous on 
the interval gq; < t < qe, there exists a solution with the arbitrary initial 
values 


to, 2, cre , XO, qd < to < q2, (28) 


defined on the entire interval gq; < t < qa. We shall show that the same 
operator A [see (10) and (11)], which was applied in the proof of Theorem 2 
but is constructed here by using the right-hand sides of system (27), 
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gives rise to a sequence of vector functions 


Polt) = Xo, gilt),---, elt), .--; (29) 


which not only converges on the entire interval q, < ¢ < qe, but also 
converges uniformly on any segment contained in that interval. To 
carry out the method of successive approximations we shall need to find 
a more precise estimate for the numbers ||g;,1 — ¢;||, where « = 0, 1, 
2,.... In this case it can be seen that the method of successive approxima- 
tions does not fit into the frame of the method of contraction mappings. 

Let A be the operator defined by relations (10) and (11), stemming from 
the system (27) with the initial values (28). Obviously, we shall apply 
the operator A to an arbitrary continuous function g(t) defined on the 
interval g,; < t < qe. By proposition (A) the system (27) with the initial 
conditions (28) is equivalent to the operator equation 


ge = Ag, (30) 


for which we shall find a solution defined on the entire interval 
qi <t <q. The functions of the sequence (29), which is -defined 
inductively by the relation 


Fin = Agi, = 0, 1, 2, tty (31) 


are defined on the interval gq, < t < qo. 
Let 7; < t < re be an arbitrary interval which contains the point to 
and which is still contained in the interval gq; < t < q2 so that 


mam <1 <to <2 < Qo. 


We shall] show that the sequence (29) converges uniformly on the interval 
ry <& < re to the solution of (30). For the right-hand sides of equations 
(27) we have : 

of i 

agi = Ui), 


and therefore for r,; < & < rg the inequalities 


af 


sa] SK 8G =1,-..57, 


are valid, where K is some positive number. Since the function ;(é) is 
bounded on the interval 7; < ¢ < re, the inequality 


lei() — go(t)| SC 


holds over this interval, where C is some constant. Furthermore, on this 
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interval we obtain from (5) and (6) the following relations: 


lot) — 1(0)| 


| (eG, ex) ~ £6, got] ar 


IA 


< n?KC\t — tol; 


| |£(r, 1(r)) ~~ f(r, ¢0(7))| dr 


I 


lps(t) — e2(t)| | [£(r, 2(T)) _ f(r, ¢1(r))] dr 


t 2 pry 2 
< [ It(r, go(r)) — £(r, ex(r))| dr] < PPI Ye — sg? 


locas ~ ol = | f [86,016 — #07, e-en)] at 


2ar\i 
< whe 
a! 


< if |f(r, ei(r)) — £(7, gi—a(z))| ar lt — tol’. 


Hence we have 


llpia1 — gall < C (WK — ri). 


! 


Since the numbers C(n?K(r2 — 71))'/i! form a convergent series, the 
sequence (29) converges uniformly on the interval r; < ¢ < r2 to a cer- 
tain continuous function g(t). Tor this function we have 


| iter, er) — £64, ofm)\ ar 


< n’K(r2 — rile: — ll; 


Ag; — Ag|| < max 
] Pr ell ~ ry StSre 


so that the sequence of functions 
Ago, A¢1, cee Agi, tee 


converges uniformly to the function Ag on the interval ry; < t < ro. 
Passing to the limit in (31) we obtain 


yg = Ag. 


Since 7} < ¢t < rg is an arbitrary interval containing the point to and 
contained in the interval gq; < t < ge, the sequence (29) converges at 
every point of the interval gq; < ¢ < qe, so that the function g(t) is defined 
on the entire interval g, < t < qe and is a solution of equation (30) on 
the same interval. Thus Theorem 3 is proved. 
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22. Local theorems of continuity and differentiability of solutions. We 
shall consider here a normal system of differential equations whose right- 
hand sides depend on certain parameters, and we shall prove that under 
certain assumptions, a solution of this system with fixed initial values is a 
continuous and differentiable function of these parameters, while in the 
case of variable initial values it is a continuous and differentiable function 
of the initial values. The normal system under consideration will be 
written in the form 


i i l . 
Fo fi(t,ct,...,ami,...,H), t= 1,...,n, (1) 


where y',...,u! are certain numerical parameters. In what follows we 
shall assume that the right-hand sides 


fiGz,..., 2% wl... w), t= 1,...,2n, (2) 


of these equations, as well as their partial derivatives 


ify el not l 
fit, zi,..., 2", po. pw) — OF ey Why) Soe posh), 
Zj=1,...,% (3) 
are defined and continuous in some domain FI of the space of the variables 
t,z',...,a",p!,...,u'. The system (1) may be rewritten in the vector 
form 
x = f(t, x,m); (4) 
here 


x = (2',...,2"), w= (u',..., n°), f= (f',...,f”) 


are vectors belonging to vector spaces of dimensions n, lJ, n, respectively. 

Continuous dependence of solutions on parameters. We shall prove 
Theorem 13 below on the continuous dependence of a solution on param- 
eters by a straightforward repetition of the proof of Theorem 2, in which 
we shall use only those steps of that proof which are mentioned in proposi- 
tion (D) of §21. 


THEorEM 13. If (to, Xo, @o) is an arbitrary point of the domain T, 
there exist positive numbers r and p such that for 


lz — mol < p 
the solution 


x = o(t, m) 
of equation (4), which satisfies the initial condition 


elo, Mt) = Xo, (5) 
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is defined on the interval [é — to| < r and is a continuous function of 
all the variables ¢, u',..., 4! on which it depends. 


Proof. First we shall select positive numbers gq, a, p such that the set 
II of all points (¢, x, #) which satisfy the inequalities 


lt ~~ tol < q [x —_ Xol < a, u ~_ Hol < p (6) 


belongs to the domain T. Since the functions (2) and (3) are continuous 
in [ and the set II is closed, there exist positive numbers M and K such 
that for every point (¢, x, w) of II the inequalities 

fGxml <M, \ilgx,e)| <K 


are valid. We now choose a positive number r which satisfies the in- 
equalities 
k 


r<q ro < 


[see (16), (19), and (22) of §21 and (D) of §21). 

For every fixed value » which satisfies the last inequality of (6), we 
replace the differential equation (4), together with initial condition (5), 
by the integral equation 


t 
v(t, m) = Xo +/ f(r, o(7, mw), m) dr (7) 


[see §21, (A)]. Using the right-hand side of this integral equation, we shall 
define an operator setting up a correspondence between the function 
v(t, #) and the function 


é 
e*(t, m) = Xo + f(r, (7, m), m) dr, 


or, briefly, 
*— A 
? u- 


We shall also define a family Q, of vector functions g(t, #) of the variables 
t, » defined on the set |t — tol <r, |w# — mo| < p, assuming that o(é, ») 
belongs to Q, if the inequality 


lp(t, #4) — Xo| Sa 


is satisfied for |t — to| <r, jw — mol < p. In proposition (D) of §21 it 
was noted that the sequence of functions 


Pott, #) = Xo, ort, #), rs) vilt, BH), sey (8) 
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defined by the inductive relation 


i441 =F A,g; 


for every fixed m, where | — pol < p, is defined on the interval 
|t — to| < r and converges uniformly to the solution g(t, w) of equation 
(7), where the inequalities 


lei41 — ill < ak? 


are satisfied. These inequalities are satisfied for any values of t, u!,... , 2! 
which satisfy the inequalities |£ — to] <r, |u# — mo| < p. From these 
inequalities it follows that the sequence (8) of continuous functions de- 
pending on ¢t, w',...,u! converges uniformly in these variables, so that 
its limit g(t, #) is a continuous function of the variables t, u!,..., ul. 
Thus Theorem 13 is proved. 

Differentiability of the solutions with respect to the parameters. We shall 
now proceed to examine the differentiability of the solution g(t, m) of 
equation (4) with respect to the parameters u!,..., u/. Asa preliminary, 
we shall prove an auxiliary proposition (A), usually called Hadamard’s 
lemma. 

(A) Let g(t',...,¢?, ul,...,u%) be a function of p+ q variables 
defined in the domain A of the space of these variables, which is convex 
with respect to the variables u!,..., u%. If we set 


t= (,...,0), u=(ul,...,u%, 


we may write it as a function g(t, u) of two vectors. We shall assume that 
in the entire domain of definition g(t,u) and its partial derivatives 
dg(t, u)/du’, 7 = 1,..., 9, are continuous. Then for any pair of points 
(t, u,), (t, w2) in A with the same coordinate t, the relation 


q . : 
g(t, 2) — g(t, ui) = >) Aj(t, wi, w2)(u — uf) (9) 
j=l 
is valid, where the functions h,(t, u,, U2), 7 = 1,...,¢, are defined and 
continuous for all values of the arguments t, u,, u, (and, in particular, for 


uy = Uy). 
To prove (A) we set 
w(s) = ui + sug — uy), OX s <1, (10) 


so that we have 


g(t, u2) — g(t, ui) = g(t, w(1)) — g(t, w)) = [ = g(t, w(s)) ds. 
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Let us now find the derivative (8/ds)g(t, w(s)); we have 


owi Os 


2 a(t, w(s)) = 2 a(t, w'),..., w%(s)) = > 


j=1 


Since it is clear that (10) implies 


then, by setting 
hj(t, ui, ug) = [ do(t, w(s)) 4, 8, 


ows 


we obtain formula (9). Since by hypothesis the functions dg(t, u)/du’ 
are continuous, the functions h,(t, u,, 2) are also continuous. Thus 
proposition (A) is proved. 


TuHEoREM 14. Let the partial derivatives 
i =~ fi p= 1 k= (il 
ek Xm) = a PG Xm), t= t,..-,n, f= 1,...,4, ( ) 


of the right-hand sides of system (1) exist and be continuous in the 
domain T. Let (to, Xo, #to) be some point of T. Then there exist positive 
numbers 7’ and p’ such that for |i — to| <r’, [a — mol < p’ the solu- 
tion g(t, #) of equation (4), which satisfies the initial condition 


¥(to, #) = Xo, (12) 


has continuous partial derivatives 


dept, #) 
oh (13) 
and mixed partial derivatives 
a7 p(t, m) 
Shank’ (14) 


which are also continuous and do not depend on the order of differentia- 
tion. Furthermore, as functions of time é, the partial derivatives (13) 
satisfy the system of equations 


5 (2a) - 2 file, oC 11), n) ew) ei(t, o(t,m),m), (15) 
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with the initial condition 


de'(to m) _ 
SoH) — 9 (16) 


The system (15) is called the system of variational equations for the 
system (1). 


Proof. To simplify the unwieldy notation, we shall assume in the proof 


that & = 1, and we shall decompose the vector w = (u!,..., 4’) into the 
vector A = (u!,..., uw’) and the scalar v = w!, so that we have 
w= (A,?). 


By Theorem (13) the solution g(é, #) = g(t, ,v) is defined and con- 
tinuous for 
lt — tol <7, — |u — mol < 2; 


in the sequel we shall choose only those values of the parameters A, v 
which satisfy the second of these conditions. 

To form the preliminary difference quotient for calculating the deriva- 
tive dy'(t, A, v)/dv, we set 


(e°(t, i, V2) _ v(t, r, ¥1)); 


; 1 
v => - 
vy (i r, Vi, V2) Ve Vy 


this quotient is defined for vy; ¥ v2. Since 
v'(t, A, ¥), t= 1,...,%, 
is a solution of the system (1), we have 


aos 1 
at y (t, A, Vi, V2) —_ Vo V1 


(2 ¢' (it, A, V2) —_ : e'(t, r, 1) 
= sa (f°(t elt, A, v2), A, V2) — F(t eA, 1), A, ¥1))- (17) 


Ve 


We apply Hadamard’s lemma [see (A)] to the right-hand side of (17), 
setting 


t= (t, d), y= (v(t, i, V1), v1), Ue= (v(t, i, V2), v2), 
g(t, u) = fi(t, p(t, A, v), A, »). 
By virtue of (A) we obtain 
rt, v(t, i, V2), i, v2) —_ rt v(t, i, V1), r, v1) 


= >> Ai(t, us, u2)(9"(t, A, v2) — 97(6,A, ¥1)) + Anga(t, ui, U2)(v2 — V1), 


j=1 
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where hi(t, Uj, Ue), 7 = 1,...,%-+ 1, are continuous functions in all 
arguments, i.e., in the arguments ¢, A, 71, Ve: 


hi(t, ur, Us) = Ait, A, v1, V2). 


Thus (17) can be rewritten in the form 


Cae ON Fs vi 

a (t,.0, ¥1, ¥2) = D> AEA, V1, VaW'Ut, A, ¥1, V2) + Ing ill, A, ¥1, V2). 
j=1 
(18) 


The last relation represents a system of linear differential equations with 
respect to the functions y(t, A, v1, ¥2), 7 = 1,...,n, where the coeffi- 
cients of this system and the free terms depend (continuously) not only on 
t, but also on A, 14, Ve. 

We shall determine the nature of the initial conditions satisfied by the 
functions ¥*(t, A, ¥1, ¥2). By virtue of (12) we have 


W'(to, A, V1, ¥2) = mon (e'(to, A, v2) — e'(to, A, ¥1)) 
1 i an 
rs (to — Zo) = 0. 


Thus the initial conditions for the functions y(t, A, v1, ¥2) are of the form 
V'(to, A, V1, ¥2) = 0. 


Our purpose is to prove that the functions y*(£, A, v1, v2) tend to a defi- 
nite limit as v2 — v,. In order to establish this fact we consider the system 
of equations 

0 Or j zi 
at ya(E, i, Vi, V2) — > Ax(t, A, Vi, va) W(t, i, Vi, V2) + hnaill, r, V1, V2) 


j=1 
(19) 
in the functions 


Wii, ¥,¥1,¥2), t=1,...,2, (20) 
which satisfy the initial conditions 
Vi(lo, r, Vi, V2) = 0. 


By Theorem 13, the functions (20) are defined and continuous for values 
of ¢,4, ¥1, ¥2 which are sufficiently close to the constants fp, Ao = 


1 1-1 Lou 
(Mo, - + +5 Ho )s Moy Mo- 
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The functions y(t, A, 1, Ye) and y(t, A, ¥1, v2) satisfy the same system 
of equations (18), (19) and have the same initial values; this fact, when 
taken together with Theorem 2, shows that they coincide: 


W'(t, 2, V1, ¥2) = W(t, A, 1, V2). 


Here the functions on the right-hand side are also defined at vy) = vg 
and those on the left-hand side are defined only for vy; ¥ va. Passing to 
the limit as ve — v, we see that 


oO - . 
av e'(t, i, v) = v(t, r, y, v), 


so that the derivative (0/dv)¢'(t, A, v) exists, is a continuous function of 
all its arguments, and satisfies the system of equations 


. no j _. 
a (agit, sv) — Di Rr,» ») SEXY 4 Lay). 2D) 
at av ~ jal ov 


From this it is seen that, not only the derivatives 


de'(t, d, v) 
Ov 


themselves exist and are continuous, but that their derivatives with 
respect to f, ie., 

a (ae'lt, 2, ¥) 

2( ov , (22) 


also exist and are continuous. 

Now that it has been established that the derivatives (14) exist and are 
continuous, the proof of the first part of the theorem is complete. The 
second part, however, is more difficult to prove. The system of equations 
(21), satisfied by the derivatives (13), contains the functions hi(t, A, v, v) 
which are determined in a comparatively complicated manner in terms of 
the functions f‘ with the aid of Hadamard’s lemma. 

To derive the variational equations (15) we shall first write out the fact 
that the functions ¢'(t, A, v) satisfy system (1): 


a; ‘i 
ape?) =P (Heb Y),-.-, eA YAY). (23) 
From what has already been proved, the functions ¢*(£, A, v) are differen- 


tiable with respect to v, so that the right-hand sides of (23) can be differen- 
tiated with respect to v; for this reason the left-hand sides of these rela- 
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tions can also be differentiated with respect to v. Thus we obtain [see 
(3) and (11)] 


23 a) 


= = Kt v(t, Xv), d,¥) = ¢ *(t,d,v) + ei(t, elt, A,),%,¥). (24) 


j=1 


From this relation it is evident that the partial derivatives 


2 (3 eit, >, ) (25) 


exist and are continuous. On the other hand, since the partial derivatives 
(22) are also continuous, it follows from a known theorem of analysis 
that the mixed derivatives (22) and (25) coincide. Changing the order of 
differentiation with respect to ¢ and vy in equations (24), we obtain the 
variational equations (15) for k = 1. 

By differentiating the initial conditions (12) with respect to u*, we 
obtain the initial conditions (16) for the functions (13). Thus Theorem 
14 is completely proved. Let us now deduce a simple corollary from 
Theorem 14. 

(B) If all partial derivatives of the right-hand sides of system (1) with 
respect to the variables z',..., 2”, u',..., qm! up to the mth order inclu- 
sive exist and are continuous, then the functions ¢*(t,u), 7 — 1,...,7, 
forming the solution of (1) and satisfying the initial conditions (12), also 
have continuous partial derivatives with respect to the parameters 
ui, ..., mu! up to the mth order inclusive. 

We shall prove proposition (B) by induction on the number m by means 
of the variational equations (15). The case m = 1 has already been 
established in Theorem 14. Let us now assume that it is valid for a given 
number m; we shall prove its validity for derivatives of order m + 1. 
We shall assume that all the partial derivatives of the functions f‘(t, x, #) 
with respect to the variables z!,..., 2", u!,...,u! up to order m+ 1 
inclusive are continuous. By the induction hypothesis, the functions 
¢'(t, 4) have continuous partial derivatives with respect to the variables 
ui,...,u! up to order m inclusive. Thus the functions d¢‘(t, u)/du*, 
which satisfy the system (15), also have continuous partial derivatives up 
to the mth order inclusive by the induction hypothesis, since the right- 
hand sides of (15) have partial derivatives up to order m inclusive, both 
with respect to the functions d¢’(t, w)/du", in which they are linear, and 
with respect to the parameters 1, ..., uw’. Thus proposition (B) is proved. 
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Continuous dependence and differentiability of the solutions with respect 
to the initial values. We shall now investigate the dependence of the solu- 
tion of equation (4) on the initial values fo, Xo; in order to emphasize the 
variability of to, Xo, we shall now denote them by 7 and &, respectively. 
The solution of (4) with the initial values 7, € now depends not only on ¢ 
and » but also on 7 and €, so that it must be written in the form 


x = g(t, m;T, €) (26) 
or, in terms of the coordinates, 
v= g(t, m5 7, €), t= 1,...,n. 


To solve the problem of the range of the variables ¢, u, 7, & where the func- 
tion (26) is known to be defined (whether it is continuous and under 
what conditions it is differentiable), we make a change of variables which 
leads us to the consideration of a solution with constant initial values and 
which transforms these initial values into parameters. Thus by a change 
of variables, the entire complex of problems concerning the dependence 
of a solution on the initial conditions is reduced to a corresponding complex 
of problems on the dependence of the solution on the parameters. This 
change of variables is given in the following proposition. 

(C) Let 7, &, w be an arbitrary point of the domain F. In place of the 
independent variable ¢, which is contained in equation (4), we shall intro- 
duce a new independent variable s by the formula 


t=Tt4+s. (27) 


In place of the unknown vector function x in (4) we shall introduce a new 
unknown function y by the formula 


K=E+Yy. (28) 


In terms of the new variables, equation (4) can be written 
d 
a= fr +s &+y,n)- (29) 


Since the function f(t, x, #) of the variables ¢, x, » is defined in F,, the func- 
tion 
g(s,y,u; 7,£) = f(r +5; + y,z) (30) 


of the variables s, y, w, 7, & is defined under the condition that the point 
(r+s,&+y,m) belongs to T. This condition, as is easy to see, dis- 
tinguishes a certain domain I'’* in the space of variables s, y, uw, 7, &, and in 
this domain the vector function (30) is continuous, and its components 
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have continuous partial derivatives with respect to the variables y!, ... , y”, 
f..., &. Let 

y = ¥(s, m5 T, &) (31) 
be a solution of equation (29) which satisfies the initial conditions 

v(0, M,T, £) = 0. 


If we go back to the old variables by formulas (27) and (28), we obtain the 
solution 


x= ell, Mm; T, £) = £ + vit — T,B;T, £) (32) 
of (4) which obviously satisfies the initial condition 
o(7,m;7,&) = €. 


From the construction given in proposition (C) by applying Theorems 
13 and 14 to equation (29), it is easy to deduce that the solution (32) is 
continuous with respect to all the variables ¢, w, 7, € and differentiable with 
respect to &!,...,&. In the case that the functions f‘(t, x, «), 
a= 1,...,7n, have continuous derivatives with respect to t, u!,...,n’, 
the solution (32) is also differentiable with respect to7, u!,..., w’. Further 
conclusions may be drawn by applying proposition (B) to equation (29). 
Here, however, we shall formulate and prove separately only the most 
essential of the results noted. 


TuHrorEM 15. Let 
# —fit,z,...,2”), @=1,...,%, (33) 


be a normal system of differential equations whose right-hand sides are 
defined and continuous, together with their partial derivatives 


i _ 8 on 
Fit, x) — oxi f , Xx), 
in a certain domain I of the space of the variables t, xz!,..., x”. Further- 
more, let 
x = f(t,x) (34) 


be the vector form of this system. If (to, X9) is an arbitrary point of I, 
then there exist positive numbers r’ and a’ such that the solution 


x = g(t; T, &) (35) 
of equation (34) which satisfies the initial condition 


g(7;7,&) =& 
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is defined and continuous in the variables ¢, 7, £ for 
ft—rl| <r’. lr — tol <a", |E — xo <o’ (36) 


and has continuous partial derivatives with respect to &',..., &",while 
the mixed partial derivatives 


a7 y(t; 7, £) 
at Os 


are continuous and independent of the order of differentiation. In addi- 
tion, the partial derivatives 


ei(t) = age? (t; to, E)le=xo 
satisfy the system of equations 


gilt) = > Filt, ot; to, Xo) ek (t) (37) 


j=1 


under the initial conditions 
vk(to) = dk, (38) 


where 6; is Kronecker’s symbol. The system (37) is called the system 
of variational equations with respect to the initial values. 


Proof. We shall utilize the transformation of variables described in 
proposition (C) in connection with equation (34), i.e., we shall assume that 
the parameter mw is absent in equation (4). By Theorems 13 and 14 there 
exist positive numbers 7’ and p’ such that the solution y = ¥(s, 7, &) is 
defined [see (31)] for 


Is] <7’, |r — tol? + |E — x0]? < (0')? (39) 


and in the same domain has continuous derivatives with respect to the 
variables £!,..., £", since the right-hand side of equation (29) is differ- 
entiable with respect to these variables. In addition, the mixed derivative 
d°y/ds 0 is continuous and does not depend on the order of differentia- 
tion. For a given number p’ it is possible to select a number o’ so small 
that the inequality (39) follows from inequalities (86) for the same values 
of 7 and &. It follows from (32) that when the conditions (36) are satis- 
fied, the solution (35) is defined, continuous, and differentiable with respect 
to £1,..., &", and also has a continuous mixed derivative 0?¢/dt d£ which 
does not depend on the order of differentiation. 
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Further, we write down the fact that the functions g(t; 7, &),..., 
g(t; 7, &) satisfy the system (33): 


Ser =fKGElGno Gro), t= bn (40) 


By what has been proved above, the functions ¢’(t; 7, ) are differentiable 
with respect to £1,..., £", and the mixed derivative 7 y'(t; 7, £)/dt ag" 
is continuous and does not depend on the order of differentiation, so that 
if we differentiate (40) with respect to £* and set rT = to, E = Xo, we ob- 
tain (37). If we differentiate the initial conditions 


(to; to, £) = & 


with respect to £*, we obtain the initial conditions (38). Thus Theorem 15 
is proved. 

We remark that to form the system (87) of variational equations it is 
necessary to know only one solution, p(f) = g(t; to, Xo) of equation (34); 
knowing this solution, however, does not enable us to calculate the deriva- 
tives g(t) = (0/0&*)e'(t; to, £)|e-x,. Thus the calculation of these deriva- 
tives, knowing one solution x = g(t), is reduced to the simpler problem 
of the solution of the linear system of equations (37). 


23. First integrals. We shall introduce here the concept of a first integral 
and solve a boundary-value problem for linear partial differential equa- 
tions. 

First integrals. Let 


= fi(a),...,2%, i=1,...,n, (1) 


be a normal autonomous system of equations whose right-hand sides, to- 
gether with their partial derivatives, are defined and continuous in some 
domain A of the space of the variables x1, ..., x”, and let 


x = f(x) (2) 


be the vector notation for this system. 
(A) A function 
u(z},...,2") = u(x), 


which is defined and continuous, together with its partial derivatives, in 
a certain domain G contained in A, is called a first integral of the system (1) 
if the substitution into (1) of an arbitrary solution x = ¢(é) of (2) leads 
to an expression which is independent of t; that is, the function u(¢(t)) 
depends only on the choice of the solution g(é), and not on ¢. Any first 
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integral u(x) of system (1) then satisfies the condition 


> du(x) rig) = 0, (3) 


= Ox? 
and, conversely, any function u(x) which satisfies condition (3) is a first 
integral of the system (1). 
We shall prove that a first integral u(x) of system (1) satisfies (3). 


Let £ be an arbitrary point of G and let x = g(t, £) be a solution of equa- 
tion (2) with initial conditions 0, £. We have 


0 = Fulvtt, Hino = oH Fe; 


t=1 


since £ is an arbitrary point of G, relation (8) is fulfilled in G. 

Let us now assume that the relation (8) is fulfilled for the function 
u(x), and let x = g(t) be an arbitrary solution of (2). Substituting x = 
g(t) into u(x), we obtain a certain function 


v(t) = u(e(d)). 


By differentiating this function with respect to t, we obtain 


at) _ ule) f(e®) = 0. 


Thus u(g(t)) does not depend on t. 

In what follows, a study of the first integrals of the system (1) will be 
carried out purely locally in a certain neighborhood of the point a of A 
which is not a state of equilibrium of the system (1): 

f(a) ~ 0. (4) 

(B) The first integrals 

uy(K),..., U(X) 


of system (1), defined in a certain neighborhood of a [see (4)], are called 
independent at the point a, or simply. independent, if the functional matrix 


(ue), i=1,...,k, j=il,...,n, 


Oxi 


is of rank k. Thus the number of independent first integrals of system (1) 
cannot exceed n — 1, and if 


U1(K),..., Un—1(X) 


23] FIRST INTEGRALS 183 


are independent first integrals, then any other first integral u(x) can be 
written in the form 


u(x) = U(uy(x),..., u—2(x)), (5) 


where U(u,..., Un—1) is a certain function of n — 1 variables and (5) 
is an identity in x in a certain neighborhood of a. 
Let us prove proposition (B). Let 


u1(X),..-, Un(X) (6) 
be a set of n first integrals of the system (1). By (A) we have 


~ OUs(X) pi a 
Axi S@ =0, j=l,...,” 


=z 
Since the vector f(x) is different from zero at the point a, then it is also 
different from zero in some neighborhood of a. Consequently, in that 
neighborhood the identity 


du;(x)\ 


is valid. Thus the set of n first integrals cannot be independent. Let us 
now assume that the first n — 1 of the first integrals of (6) are inde- 
pendent; then by a known theorem of analysis it follows from (7) that the 
function u,(x) may be expressed in terms of the remaining functions 
uy(X),..., Un—1(xX), that is, identity (5) is valid for u(x) = uaz(x). Thus 
proposition (B) is proved. 

(C) There exist m — 1 independent first integrals of the system (1) 
in some neighborhood of the point a [see (4)]. 

Let us prove this. Since the vector f(a) is different from zero, at least 
one of its components is different from zero. We shall assume that 


f(a) ¥ 0. 


Let —€ = (£',..., £*71, a”) be a point close to the point a, and let x = 
v(t, &) be a solution of (2) with the initial conditions 0, & This solution 
can be written in the coordinate form 


a= g(t, 8,...,8 71), t=1,...,2. (8) 
We shall regard (8) as a system of equations in the unknowns 
By... (9) 


Tor «? = a’, i= 1,...,n, this system of equations has the obvious 
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solution £} = a!,..., &*~1 = a"—!, t = 0, and the functional determi- 
nant of the system (8) is different from zero at this point, since 


ay'(0, a, ... , a”—) 
ogi 


[see (88) of §22], and 
f"(a) = 0. 


Thus there exists a neighborhood G of the point a such that for x belonging 
to G, the system (8) is solvable for the unknowns (9), and the solution 
may be written in the form 


fl = ui(x), ..., E71 = up_i(x), =f = 0(x). (11) 
We shall show that the functions 
uy(X), coy Un—1(X) (12) 


in these relations are first integrals of the system (1) and, moreover, are 
independent at a. Since the functional determinant of system (8) has been 
found [see (10)], it follows that the functional matrix 


(u@), ij=1,...,n—1, 


Oxi 


is a unit matrix, and therefore the functions (12) are independent. We 
shall show that they are first integrals of the system (1). Since the system 
(11) is the inverse of the system (8), the functions (12) satisfy the identities 


u(el,))= 6, t=1,...,2—-1. (13) 


Now let x = ¢(¢) be some solution of equation (2) passing through the 
domain G. Let to, Xo be its initial values, with xo belonging to G. Since 
the system (8) is solvable when x = Xp, there exists a solution x = 
g(t, 9) passing through x9, so that the solution g(t) can be written in the 
form 


v(t) = o(t + ¢, Eo), 


where c is a constant [see (B) of §15]. Thus, substituting x = g(#) into 
the function u,;(x), we dbtain by (13) 


us(e(t)) = ur(elt +c, Eo) = fo, t= 1,...,» —1, 


and proposition (C) is proved. 
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If certain first integrals of system (1) are unknown to us, then the solu- 
tion of system (1) can be facilitated in the same way. This case may be 
formulated precisely in the following proposition. 

(D) Let 

Unt1(X), --- » Un(X) (14) 


be a system of n — k first integrals, independent at the point a [see (B)], 
of the autonomous system (1). By using the functions (14), the order of 
(1) can be decreased by n — k, ie., it can be replaced by an autonomous 
system of order k; in particular, when we deal with a maximal number 
n — 1 of independent first integrals, the autonomous system (1) can be 
reduced to the first order and therefore [see (B) of §2] can be solved by 
quadratures. 

We shall prove proposition (D). Since the first integrals (14) are inde- 
pendent, the functional matrix 


(ule), t=k+1,...,n, j=l,...,n, 


Oxi 


contains a square matrix of order n — k whose determinant is different 
from zero. To be definite, we shall assume that the determinant of the 


matrix 
Ou;(a . 
(aia), 4j=k+1,...,n, 


is different from zero. We can now introduce in the neighborhood of the 
point a the new coordinates 


yrs. ,y” (15) 


in the place of the previous coordinates 


zie... 3” 
by setting 
y=ow,..., y=, 
yt) = ug (X), 2. y™ = Un(k). (16) 
The new coordinates y!, ... , y” are, in fact, introduced by these formulas, 


since the functional determinant of the system (16) is different from zero 
in the neighborhood of a. In the new system of variables (15) the system 
(1) takes the form 


y = g'y',...,y”), t= 1,...,7. (17) 
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But since every function (14) satisfies (3), we have 


g**(y) = 0, sey gy” (y) = 0, 


and therefore the system (17) is actually an autonomous system of order k. 

Linear first-order partial differential equations. The relation (3) can be 
considered as a partial differential equation in the unknown function u(x) 
with variables x', ..., x”. Propositions (C) and (B) have shown that for 
f(a) # 0 there exist n — 1 independent solutions of this equation in the 
neighborhood of the point a, and that, having n — 1 independent solutions, 
we can obtain any other solution of this equation with the aid of formula 
(5). In this connection it is evident that every function given by (5) is 
a solution of (3), and therefore (3) can be regarded as solved. In other 
words, it has been shown that if we know how to solve (1), we also know 
how to solve (3). It is possible, however, to approach the solution of (3) 
from another point of view, that is, it is possible to pose and to solve a 
boundary-value problem for equation (3) and even for an equation of more 
general form than (3). 

(E) Let 


Z 4 Ou 
LI@ sm Faw) (18) 
be a partial differential equation in the unknown function u(x), where 


F(x, u) is a certain prescribed function having continuous first-order 
partial derivatives with respect to all its arguments. Further, let 


x= &(t},..., 17?) (19) 
be the vector form of a given (n — 1)-dimensional surface which passes 
through the point a for #1 = --- = ¢”~! = 0, so that 

&(0,...,0) = a. 


We shall assume that the surface (19) is differentiable and is not tangent 
to the vector f(a) at a so that the vectors 


d£(0,...,0) — 0&(0,..., 0) 

ry , , ry , f(a) (20) 

are linearly independent. Finally, let 
uo(t!,..., 71) (21) 


be a certain function defined on the surface (19). Then, in the neighbor- 
hood of a, there exists (and uniquely, moreover) a solution u(x) of equation 
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(18) which coincides on the surface (19) with the given function (21), so 
that 
u(é(t!, ..., 7) = uolt!, 22, 74). 


To find the solution u(x) we use the trajectories of the system (1), which 
start on the surface (19). These trajectories are called the characteristics 
of equation (18). 

Let us prove proposition (E). We introduce new coordinates into the 
neighborhood of a of the phase space of the system (1) in place of the 


coordinates x!,...,2". Let x = ¢(t,t',...,%"—1) be a solution of equa- 
tion (2) which starts at the point &(t!,..., ¢"—1) of the surface (19), so 
that its initial values are 0, &(t',..., #*~!). We then have the system of 
relations 

v= gi(t,t,...,t 7), i= 1,...,0. (22) 


If we consider as unknowns the variables 
tty... t 7, (23) 
then for x = a this system has the obvious solution 
i=t} a... = 1-9, 


and its functional determinant does not vanish at this point, a fact which 
follows from the linear independence of vectors (20) (see §22, formula 
(38)]. Thus the system (22) allows us to introduce new coordinates (23) 
into some neighborhood of a in place of the coordinates z!,...,2”. In 
these new coordinates the form of (18) is particularly simple, and the 
boundary-value problem stated in proposition (E) can be easily solved. 
Let u(x) be a certain function defined in the neighborhood of the point a. 
Let us substitute into this function the variables (23) in place of the 


variables x',..., «” according to (22); we then obtain the function 
v(t, #,...,07 1) = ule, tt... , 177). 
We have ; 
dv(t, t, — sO) _ » Ou) F(x), 
where x = o(t,é},...,¢"~"). Thus, in terms of the variables (23), 


equation (18) will have the form 


v(t, tt)... , ¢°74) 
at 


= F(g(t,t',..., #77), oft, ,..., 0°79). (24) 
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Since the surface (19) in the coordinates (23) is given by the equation 
t = 0, we must find a solution of equation (24) which reduces to the 
given function uo(t!,...,¢"~1) for £ = 0. To find such a solution, it is 
necessary to solve (24) by regarding it as an ordinary differential equation 
with the independent variable ¢ and the variables ¢t', ..., ”—! as param- 
eters. Moreover, we must find solutions with the initial values 


0, wo(t!,..., 71). 


The function v(t, t!,...,t—1), obtained by virtue of Theorem 14, has 
continuous derivatives with respect to all variables. Thus the boundary- 
value problem posed in (E) is solved. 
Note. Let 
z= fit, 2!,..., 27) (25) 


be a nonautonomous system of differential equations. In order to in- 
troduce the concept of a first integral of this system, we shall transform 
it into an autonomous system by introducing the auxiliary unknown func- 
tion 


Then system (25) augmented by the equation 


grt =— | 


will be autonomous; its first integrals may be taken as first integrals of 
the system (25). 


EXAMPLE 


Let 
H = H(x',...,2";y",...,y") = Hy) (26) 


be a function of two systems of variables. The system of ordinary differen- 
tial equations 
i re) 


—_ ay? Hx, y), 


R 
| 


(27) 


t 


. c] . 

y — an AY), t= 1,...,2, 

is called a Hamiltonian system, and the function H(x, y) is the Hamil- 
tonian function of this system. We see directly that the function (26) 
is a first integral of (27). 


24) TRAJECTORIES ON LARGE TIME INTERVALS 189 


24. Behavior of the trajectories on large time intervals. In proving 
Theorem 2 (see §21) for the given initial values ¢o, X09, a positive number r 
was found such that a solution x = g(t) with these initial values exists 
on the interval |f — f| < r. Actually, the maximal interval of existence 
m, <! < mg [see §3, (A)] of the solution x = g(t) can be larger than 
the interval |f — t9| <r. However, the question of why the maximal 
interval of existence m,; < ¢t < mg can be bounded from one side or the 
other, or even bounded in general, has not been discussed up to this point, 
except in the case of Theorem 3, which dealt with the linear equation, 
where the problem was solved completely. In this section we shall answer 
further the question of why the maximal interval of existence can be 
bounded from one side or the other. 

(A) Let 


od 1 . 
Le = f(t,x,..., 2”), += 1,...,%, 


be a normal system of differential equations whose right-hand sides, to- 
gether with their partial derivatives 


of" 
oxi’ 


are defined and continuous in some domain I of the space FR of the vari- 
ables t, z!,..., 2”, and let 


x = f(t, x) (1) 


be the vector notation of this system. In addition, let # be a certain 
closed bounded set in I, and let 


x = ¢(t) (2) 


be a certain solution of equation (1) with maximal interval of existence 
m, <t < mg [see §3, (A)]. Thus, if the number mg is less than -+o, 
there exists a positive number é€g such that for tf > m2 — €g the point 
(t, p(t)) is outside the set EF. In exactly the same way, if the number m 
is greater than —o, there exists a positive number ¢€; such that for 
£ < m, + €, the point (¢, 9(¢)) is outside the set F. 

To prove proposition (A), we shall use the bound on the number r given 
in proposition (D) of §21. We shall consider only the case mz < +0 
since the case m, > —o is handled in the same way. We shall introduce 
a Euclidean metric into R. Since the set F and the complement of the 
set TI are closed and E is bounded, the distance p between F and the com- 
plement of I’ is positive. This means that if the distance between the point 
(to, Xo) of HE and the point (t, x) of R is less than p, then (¢, x) must belong 
to the domain [. Now let £* be the set of all points of R whose distance 
to the set EF does not exceed the number p/2. Then £* is contained in I, 
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so that the right-hand sides of the system (1) and their derivatives are 
defined on the set H*. Let us choose two positive numbers g and a, such 
that 


p\? 
gta< (3) . (3) 
Let (fo, Xo) be a certain point of the set # and II be the set of all points 
(t, x) which satisfy the inequalities 
Jté— tol Sq, |x — Xo <a. 
It is obvious from (3) that I] is contained in E*. Since E* is closed and 


bounded, it follows that for any point (t, x) of E*, the inequalities 


< K, 4j=Hl,...,2, 


af*(t, x) 
lf, x)| < M, aru) 


are fulfilled, where M and K are certain positive numbers. Thus if the 
number r satisfies (16), (19), and (22) of §21, the solution x = g(t) of 
equation (1) with the initial values (fo, Xo) is defined on the interval 
| — to| <r. The only important fact here is that the number r which is 
found is the same for all points (£9, Xo) of the set #. Hence for the number 
€2 we may now take r. 

Let us assume the opposite, i.e., that for a certain t9 > me — r the 
point (to, (to)) belongs to the set EF. Then we can take the values to and 
Xo = (tp) as initial values of the solution (2). By the bound given in 
proposition (D) of §21, the solution (2) is defined on the interval |t — to| < r, 
which obviously falls outside the limits of the interval m; < t < mg. 
But this contradicts the fact that m; < t < mg is the maximal interval 
of existence of the solution (2). With this contradiction, proposition (A) 
is proved, 

(B) Let 

z = fitt,2!,..., 27), i=1,...,n, (4) 


be a normal system of equations, whose right-hand sides are defined and 
continuous, together with their partial derivatives af*/dx’, in some domain 
I of the space R of the variables ¢, x',..., x”. Here I’ has a special form, 
i.e., it consists of all points of the form (é,2',...,27), where ¢ is an 
arbitrary number and the point (x!,..., 2%) belongs to a certain well- 
defined domain A of the space S of the variables x!,..., 2". In the par- 
ticular case when system (4) is autonomous, we can take A to be an 
arbitrary domain in which the functions f* and df‘/dx’ are defined and 
continuous, and then form I on the basis of this domain A. Let 


x = f(t, x) (5) 
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be the vector representation of the system (4), and let F be a certain 
closed, bounded set in Aand x = g(E) a certain solution of (5) with maximal 
interval of existence my < t < my. Then if the number mz is less than 
+o, there exists a positive number €g such that for t > mg — €» the 
point g(é) lies outside the set F. In exactly the same way, if the number 
m, is greater than — oo, there exists a positive number e€, such that for 
i < m, + €,; the point g(t) lies outside F. 

As in the proof of (A), we shall confine ourselves to the case mz < +2. 
In order to reduce the proof of (B) to that of (A), we select in a suitable 
way a closed, bounded set #. Let m be an arbitrary number which satisfies 
the inequality m < m2; we shall define E as the set of all points (i, x), 
where m < t < me and x belongs to the set F. By (A), there exists a 
positive number €, such that for ¢ > m2 — €2, the point (¢, g()) does 
not belong to #. We can assume here that €2 < mp — m, that is, 
mz — €2 > m. Since the number ¢ satisfies the inequalities mz — €2 < 
ti < me, so that the inequalities m < ¢ < mp are satisfied, the point 
(t, o(¢)) need not belong to E because y(t) does not belong to F. 


EXAMPLE 


To illustrate the results of this section, we shall consider a first-order 
autonomous equation 
1 
t= > 6 
Te)’ ©) 
where f(z) is a polynomial all of whose roots are real and simple; let 
@1, @2,..., @ be their enumeration in increasing order. The phase space 
of equation (6) is a straight line P, all of whose points can serve as the 
domain A with the exception of the points a1, a2, ..., @n, where the right- 
hand side of (6) becomes infinite. If we set 


F(z) -{ F(§) dé, 


then the set of all solutions of (6) is described by the relation 
F(z) = te. 


Since in an autonomous equation a constant time shift does not change 
the trajectories, the set of all trajectories of (6), together with the descrip- 
tion of the motion of x(¢) along them, is given by the relation F(x) = t. 
Let us study the motion of z(t) along the interval ay < x < ag. Since 
f(z) does not change sign on the interval ay < x < a2, we have F(a;) #¥ 
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F (a2). To be definite, we shall assume that 
m= F(a) <M. = F(a). 


It is easy to see that as ¢ traverses the interval m; < t < me, the point 
z(t) traverses the interval ay < x < ad. Hence it is obvious that m, < 
t < mz is the maximal interval of existence of the corresponding solution. 
Here both endpoints of this interval are finite; on the basis of proposition 
(B), this is explained by the fact that in approaching the endpoints of the 
interval m; < t < mg, the point x(t) approaches the boundary of A. 


25. Global theorems of continuity and differentiability. Theorems 13, 
14, and 15, which were proved in §22, establish certain properties of the 
solutions, i.e., continuity and differentiability with respect to the param- 
eters and the initial values. All these theorems, however, were of a local 
character, that is, they referred to sufficiently small intervals of time. In 
this section we shall prove the so-called integral theorems or global theorems 
of continuity and differentiability of a solution with respect to the param- 
eters and the initial conditions. The term “integral,” as applied to 
Theorems 16, 17, and 18 to be proved here, has no relation to the operation 
of integration, but means that continuity and differentiability are estab- 
lished “integrally,” i.e., on some “large” interval of time. 

The proof of the integral theorem of continuity (Theorem 16) differs 
from that of Theorem 13 and will be presented without reference to 
Theorem 13. The integral theorems of differentiability (Theorems 17 
and 18) are proved in exactly the same way as the corresponding local 
Theorems 14 and 15, so that their proofs will not be written out in detail. 

We shall consider the normal system of equations 


t= fid,z',...,2%u',...,0), t= 1,...,n, (1) 

whose right-hand sides depend on the parameters w!,..., 4’. System (1) 
may be written in vector form 

x = f(t, x, u), (2) 


and from now on we shall assume that the right-hand sides of (1) are de- 
fined and continuous, together with their partial derivatives 


af'(t, x, #) 
Oxi 


= filt,x,m), (3) 


in a certain domain F of the space F of the variables 


Let 
x= v(t, #) (4) 
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be the solution of equation (2) with initial values to, Xo. It will be proved 
that if the solution x = g(t, #9) is defined on the interval r; < t < ro, 
then when wp is sufficiently close to yo, the solution (4) is defined on the 
same interval and the function (4) is continuous in (¢, w). In addition, 
under the hypothesis that the partial derivatives df/du* exist and are 
continuous in T, it will be proved that (4) is differentiable with respect to 
the parameters w!,..., 4’ for ry < ¢ < ro and for mw sufficiently close to 
Mo. It is from these theorems and the use of the construction (C) of §22 
that we shall prove the corresponding integral theorem of continuity and 
differentiability with respect to the initial conditions. 

It is now clear that the content of this section is very similar to that of 
§22: the main difference is that we now assume that one solution ¢(é, uo) 
is given on a specific interval r; < ¢ < re, and the existence, continuity, 
and differentiability of the solution g(t, a) (for « close to yg) is established 
over the entire given interval r; < ¢t < re. The situation is the same for 
the dependence on the initial conditions. 

Continuous dependence of the solutions on the parameters. We shall prove 
first the following proposition, which plays an auxiliary role in the proof 
of the integral theorem of continuity. 

(A) Let u(é) be a continuous function of ¢ on the interval tg < ¢ < t1; 
if u(t) satisfies the integral inequality 


t 
ui) < f (ult) + 0)dr, a >0, B>0, (5) 
to 
on this interval, then the estimate, 
B patty __ 
ul) S =e 1) (6) 
is valid. 
For the proof we set 
t 
v(t) = / (au(r) + 8) dr, (7) 
0 


so that 
a(t) = au(t) + 8B. 


T'rom the last equality we have 
1, 
u(t) = = (o() — 8). 


Then, from the original inequality we have 


* @@ — 8) < o, 
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or, what is the same thing, 


b(t) — av(t) < B. 


ped 


If we multiply this last inequality by e—*', we obtain 


s()e-** — v(tae~t = £ (u(e-*) < pe-*". 


Integrating the last inequality from t) to ¢ and using the fact that 
v(to) = 0, which follows from (7), we find that 


vent < Fee — oh, 


or 


of) < E et — 1), (8) 
Trom (7), (5), and (8), we obtain the required inequality (6). 
THEOREM 16. Let (fo, Xo, #o) be a certain point of the domain I and let 
x = o(t, mu) (9) 
be a solution of equation (2) which satisfies the initial condition 


v(to, BH) = Xo. 


If the solution 
x= v(t, Mo) 


is defined on the interval r; < ¢ < ro, then there exists a positive 
number p such that for |~ — sol < p the solution (9) is defined on the 
same interval r; < é < rg and the function g(t, a) is continuous in 
the variables ¢, w for |“ — pol < p,71 < t < ro. 


Proof. As the number ¢ traverses the interval r; < t < re, the point 
(t, p(t, #0), #0) describes some curve Q in the space R. We shall construct 
a certain closed neighborhood fi of the curve Q. Let a and b be two positive 
numbers. We shall denote by fi the set of all points (t, x, #) of R which 
satisfy the conditions r; <t < re, [x — e(t,mo)| < a, |w — mol <b. 
From the fact that the solution g(t, #9) is continuous and passes through 
T it follows that there exist positive numbers a and b such that the set II 
is contained in T. Henceforth we shall assume that a and b satisfy this 
condition. Since the derivatives (3) are continuous on the set ff and are 
therefore bounded in modulus by some number K on Il, it follows from the 
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inequality (6) of §21 that for any two points (t, x1, ), (t, Xo, «) of IL we 
have the relation 


f(t, X2,#) — f(t, x1, m)| < n?K|xg — xj]. (10) 


Further, from the uniform continuity of the function f(t, x, #4) on the set 
fl it follows that there exists a positive monotonic function 8(€) of € which 
tends to zero with € > 0, such that for any two points (¢, X, uo), (t, X, #) 
of II the relation 

f(t, x, m) — f(t, x, Mo)| < B(|m — mol) (11) 
is satisfied. 

Now let x = g(t, #), |# — Mol < 0, be the solution of (2) with initial 
values (to, Xo). We shall assume that this solution is defined on the in- 
terval t; <¢ < te, where ry < t; < to < tg < re, and is completely 
contained in the set Il, that is, for t; < ¢ < t, the point (¢, (t, mu), m) 
lies in II. We shall estimate the difference |o(t, «) — y(t, #o)| on the 
interval. We shall carry out calculations only on the interval tg < t < te, 
since the calculations on the interval 4; < ¢ < ty are similar. 

Let us write equation (2) in the form of an integral {see §21, (A)] for 
both values of the parameters # and mo, and subtract the second integral 
from the first; we then obtain 


t 
v(t, #) —_ v(t, Mo) = / [f(r, e(T, #), h) —_ f(r, e(7, Ho); Ho)] dr, 
0 
to <t< te. 
Let us estimate the difference on the right under the integral sign. We have 
lf(r, (7, m),) — f(r, o(7, m0), mo)! < [f(r, o(7, m), wm) — f(r, (7, mo), #)| 
+ I(r, @(7, mo), #) — £(7, (7, Ho), mo): 


The first term on the right-hand side can be estimated by means of the 
inequality (10) and the second by the inequality (11). Combining these 
two estimates, we obtain 


l(t, ) — v(t, mo)| < / [n?K - |o(r, w) — o(t, mo)| + B(\e — mol)] ar. 


If we set u(t) = |e(t, #) — o(t, wo)|, it follows from proposition (A) that, 
fortp St < le Sre, 


B(\e — Mol) (er? toy 1) 
n 


lo(t, #) — e(t,mo)| < OK 


< Ble pal) (e?’ Ker2—r1) — 1) = CB(\e — po). 
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By making a similar estimate on the interval {; < ¢ < to, we obtain 
lo(t, m) — v(t, Ho)| S CB(\m — mol). (12) 


Thus the estimate (12) is valid over the entire interval {; < t < ty. 
Let p be a positive number which satisfies the inequalities 


pb, CB(p) <a, (13) 


and m a fixed value of mu, for which 


[41 — Mol Sp. 


We shall show that the solution g(t, #;) is defined on the entire interval 
Tr < t < To. 

Let m, < ¢ < mg, be the maximal interval of existence of the solution 
o(t, #1) [see §3, (A)]. We shall show that mz > rg (the inequality m,; < rj 
is proved analogously). For this we shall denote by & the set of all points 
(t,x, «) of Tl for which « = m,. Let us assume that my < re. It then 
follows from (A) of §24 that the point (t, 9(¢, #1), #1) must leave the closed 
set HE as t— mz. This is possible only by violating the inequality 
lo(t, #1) — (t, wo)| < a, since the inequality ¢ < r2 holds by hypothesis. 
Since the inequality |g(t, #1) — o(t, #o}| < @ is violated for some value 
of t, there exists a value of t, which we shall denote by ¢2, such that 


lo(t2, #1) — e(te, mo)| = a, 
lo(t, #1) — lt, mMo)| <a for to <t < le. 


This is impossible, however, because of the estimate (12) [see (13)]. 

Thus the solution g(t, «) is defined on the entire interval r; < ¢ < re 
for |“ — mol < p. Thus the first assertion in Theorem 16 is proved; we 
shall prove the second assertion, i.e., the continuity of the function 
v(t, m) forr, < t < re, |w — pol < p. We shall prove the continuity of 
e(t, w) at the point (5, 5), where r: < t§ < re, up — Mol < p. Since 
the solution g(t, 45) satisfies the hypotheses of Theorem 16, it follows from 
what we have already proved that there exist for it a positive number p*, 
analogous to p, and a function C*@*(e), analogous to C@(e). Now let » 
be a value of the parameter satisfying the condition |u — uo| < p* and 
ta number satisfying the condition r; <§ < ro. Let us estimate the 
difference y(t, #) — ¢(t5, m9). We have 


lo(t, m) — v(t, #0)| < le(t,#) — elt, #0)| + lel, #0) — (és, #0)I- 


The second term on the right-hand side tends to zero as ¢ —» tg because 
of the continuity of g(t, 4g) in t, while the first tends to zero as w— "5 
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(uniformly with respect to f) because of the existence of C*8*(e). This 
proves Theorem 16 completely. 
Differentiability of the solution with respect to the parameters. 


THEOREM 17. Let us assume that the partial derivatives 
j =~ ¥ p= k= i 
eh Hw) = FFG H), b= 41... yt, =1,..., ) 


of the right-hand sides of the system (1) exist and are continuous in the 
domain T. Let (fo, Xo, #0) be a certain point of T and 


x = g(t, u) (14) 
a solution of equation (2) which satisfies the initial condition 


e(to, H) = Xo. (15) 


If the solution 
x = oft, Mo) 


is defined on the interval r; < ¢ < re, then there exists a positive 
number p’ such that for |# — pol < p’ the solution (14) is defined on 
the interval 7; < ¢ < rz and has continuous partial derivatives 


dell, m) , 
the mixed partial derivatives 


a’ g(t, m) 
Ot dpe 


are also continuous and do not depend on the order of differentiation. 
In addition, the partial derivatives (16) satisfy on the interval r; < 
t < ro the system of variational equations 


d [agit ni agi(t 

at (ea) = ¥ filt, ot, m), #) eter + ex(t, o(t, m), m) 
j=l 

under the initial condition 


d¢*(to, m) =— 0 
Our . 


Proof. The proof of Theorem 17 repeats almost verbatim the proof of 
Theorem 14 with the following modifications. The reference to Theorem 13 
is replaced by a reference to Theorem 16. In addition, the solution o(t, ms) 
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is considered on the entire interval r; < t < re, and not on the interval 
|t — to| <r; the same is true of the functions (20) of §22. Analogous 
modifications must be made in order to prove proposition (B) below and 
Theorem 18. 

We shall now make one simple deduction from Theorem 17. 

(B) If all partial derivatives of the right-hand sides of system (1) with 
respect to the variables x!,..., 2”, ul,...,m! up to order m inclusive 
exist and are continuous, then the functions ¢‘(t, ), 7 = 1,...,, which 
comprise a solution of (1) and satisfy the initial conditions (15), also 
have, for | — mol < p’, r: < & < re (where p’ is a sufficiently small 
positive number), continuous partial derivatives up to order m inclusive 
with respect to the parameters y/,..., wu. 

[See the proof of proposition (B) of §22.] 

Continuous dependence and differentiability of the solutions in terms of 
the initial values. By means of the construction given in (B) of §22, we can 
obtain the following result from Theorems 16 and 17. 


THEOREM 18. Let 
#@=fitc',...,2), i= 1,...,n, 


be a normal system of differential equations whose right-hand sides are 
defined and continuous, together with their partial derivatives 


filt, x) = x rt x), 


in some domain I of the space of the variables t, x',... , «”. In addition, 
let 
x = f(t, x) (17) 


be the vector representation of this system. If (tp, X9) is an arbitrary 
point of T and if the solution 


X= g(t) 


of equation (17) with the initial values tg, Xo is defined on the interval 
ry, < & < ro, then there exists a positive number p’ such that for 


mr St< 1, Ir — tol <0’, |E — Xo] <0’, 


the solution 
x = g(t, T, £) 


of equation (17) with initial values 7, £ is defined and continuous in the 
variables ¢, 7, £ and has continuous partial derivatives with respect to 
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the components ¢!,..., &* of the vector £, while the mixed partial 
derivatives 
d°¢'(t, 7, &) 
ot ogs 


are continuous and do not depend on the order of differentiation. In 
addition, the functions 


i QO 
eH) = ax OC fo, Ble, 


satisfy the linear system of equations (the variational equations) 
gi = DO Fit oO) ek) 
j=1 
under the initial conditions 
vk(to) = ob. 
(See the proof of Theorem 15.) 


(C) When the conditions of Theorem 18 are satisfied for any positive 
number €, a positive number 6 can be found such that for |r — fo| < 6, 
|E — Xo| < 6 the inequality 


l(t, T, §) —_ v(t, to, Xo)| <€ (18) 


is valid for any ¢ in the interval r; < ¢ < ro. 

To prove this inequality, we note that for 0 < 0” < o’ the function 
g(t, 7, &), which is continuous by Theorem 18, is uniformly continuous 
on the closed set defined by the inequalities 

lr — tol So”, |E — Xo| < 0”, ry St < re. 


Thus for any positive €, a positive 6 can be found such that for |r — to| < 4, 
| — Xo| < 4, lt) — te| < 6, the inequality 


le(te, T, £) ~— ets, to, Xo)| <€ 


is valid. Inequality (18) follows from this inequality for tj = t2 = t. 


CHAPTER 5 
STABILITY 


The performance of numerous mechanical, electrical, and other types 
of devices (machines, instruments, etc.) is described by systems of ordinary 
differential equations. A system of ordinary differential equations always 
has an infinite number of solutions, and in order to find a certain definite 
solution it is necessary to specify its initial values. However, devices 
which are used in practice usually operate under completely well-defined 
conditions, and in their performance it is impossible, at first glance at 
any rate, to discover an infinite number of operational conditions which 
correspond to the various solutions of the system of equations. This can 
be explained either by the fact that the initial values of the solution are 
chosen at the start of the operation in a certain well-defined way, or by 
the fact that the initial values lose their effect during continued operation 
of the device, and the device itself stabilizes its operation at a stationary 
solution. We have already encountered the latter phenomenon when we 
analyzed the performance of electrical circuits. We shall give one more 
example. A clock runs with a completely prescribed amplitude of its 
pendulum, although when the clock is started, the pendulum can deviate 
either slightly or greatly from the vertical position. If in starting the clock 
the pendulum is not deflected sufficiently, then it will stop after a few oscil- 
lations. If the deflection is great enough, then after a short time the 
oscillation amplitude of the pendulum becomes quite well-defined, and the 
clock will run with this amplitude for an indefinite period of time, if not 
forever. Thus the system of equations which describes the performance 
of the clock has two stationary solutions: a state of equilibrium which 
corresponds to the rest position, and a periodic solution which corresponds 
to what we think of as the normal performance of the clock. Every other 
solution—and there are undoubtedly an infinite number of these solutions— 
approaches one of these two stationary solutions very rapidly, and after 
a short time becomes practically indistinguishable from it. Each of the 
two stationary solutions noted appears to be in a certain sense stable. 
This means that if we take a solution which is not stationary, but which 
deviates very little from a stationary solution at time zero, then the non- 
stationary solution approaches the stationary solution. This is a definition 
of the stability of a solution, though not formulated in precise terms. From 
this example it is seen that the phase space of the system of equations which 
describes the performance of the clock decomposes into two attractive 
domains. If the initial value is taken in one of the domains, then the 
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solution will tend toward the state of equilibrium; if the initial value is 
taken in the other domain, then the solution will tend toward the periodic 
solution. 

From what we have seen, it is clear that in order to understand com- 
pletely the performance of any device we must analyze the phase space 
of the system of equations which describes the performance of the device. 
Here it is most important to know all stable solutions of this system of 
equations. 

Trom the integral theorem on the continuous dependence of a solution 
on its initial values (see §25), we already know that if a definite finite 
interval of time is set, then for a sufficiently small deviation of the initial 
values, the solution will undergo only a small deviation on the entire 
time interval; this property of the solution, however, does not necessarily 
mean stability. Where stability is concerned, the deviation on an infinitely 
large time interval must be small only if the deviation from the initial 
values is small. 

The present chapter is devoted basically to the problem of stability of 
states of equilibrium and of periodic solutions. 

We also include two important applications to engineering problems: 
Vyshnegradskiy’s work on the performance of a steam engine with Watt’s 
governor and Andronov’s work on the operation of a vacuum-tube oscil- 
lator generating nondamping electrical oscillations. The first of these 
studies has been the basic principle in the theory of automatic control, 
and the second, in the theory of nonlinear oscillations. 

In §30 we shall study the behavior of trajectories close to the equilibrium 
states of a second-order autonomous system, a problem which does not 
pertain entirely to the stability problem. This section is somewhat more 
difficult than the average level of the book. Still more difficult in content 
is the last section of this chapter (§31). 


26. Lyapunov’s theorem. We shall describe here the concept of 
stability and sufficient stability conditions in connection with the equilib- 
rium state of an autonomous system (see §15). 

The stability of an equilibrium state. Let 


# = fi(e!,...,2%, ix=l,...,n, (1) 

be a normal autonomous system and let 
x = f(x) (2) 
describe the system in vector notation. We shall assume that the functions 


fi(v',...,2%, t=1,...,n, (3) 
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are defined and have continuous first-order partial derivatives in a certain 
domain A of the space of variables z!,...,2". Later, in establishing 
stability criteria, we shall strengthen the differentiability requirements 
by assuming that the functions (3) have continuous second-order partial 
derivatives in A. 

Without giving the formal definition of Lyapunov stability, we shall 
first try to convey the idea of stability. The equilibrium state a = 
(a!,..., a") of equation (2) should be considered stable if any solution of 
equation (2) starting at ¢ = 0 from a point sufficiently close to a remains 
in the neighborhood of a during subsequent variation (i.e., for £ > 0). 
The physical sense of stability is clear. A physical object (for example, 
some machine) whose motion is controlled by equation (2) can be in the 
equilibrium state a only if this state of equilibrium is stable, since a trivial 
deviation from the state of equilibrium caused by a random impulse can 
force the object far out of the state of equilibrium. 

Below, we shall denote by ¢(t, &) the solution of equation (2) with initial 
values ¢ = 0, x = &, so that ¢(t, &) is a vector function of the scalar vari- 
able ¢ and the vector variable £, which satisfies the condition 


(0, &) = &. (4) 


Definition. The equilibrium state a of equation (2) is called Lyapunov 
stable if (1) there exists a positive number p so small that for |g — a] < p 
the solution g(t, £) of (2) is defined for all positive values of ¢; (2) for any 
positive number e there exists a positive number 6 < p such that for 
|e — a| < 6 we have |g(t, £) — a| < e€ for all £ > 0. An equilibrium 
state a of equation (2) which is Lyapunov stable is called asymptotically 
stable if (3) there exists a small positive number 0 < p such that for 
|— — al < o we have 

jm, lott, £) _ a = 0. 


We shall first give sufficient conditions for the stability of the equilibrium 
state for a linear homogeneous system with constant coefficients. 

(A) Let 

xk = AX (5) 

be a linear homogeneous equation with constant coefficients written in 
vector form. Its solution with initial values 0, & will be denoted by 
w(t, €). If all eigenvalues of the matrix A have negative real parts, then 
there exist positive numbers a and r such that the inequality 


v(t, &)| < rléle“", 94 2 0, (6) 


holds. It follows directly from (6) that the equilibrium state x = 0 of 
equation (5) is Lyapunov stable and asymptotically stable. 
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Let us prove inequality (6). Let us set 


A= (ai), L(p) = (a} — pdi). 


Then, by using the differentiation symbol p (see §7), equation (5) can be 
written in a scalar form as the system 


>, Lip)? = 0, i=1,...,2. (7) 
j=1 
Let Mi(p) be the minor of the element Li(p) of the matrix L(p) taken 
with the appropriate sign, so that the identity 
Do Mi@)Li@) = 8 D(p) 
i=l 


= 


holds, where D(p) is the determinant of L(p). By multiplying (7) by the 
polynomial M?(p) and summing with respect to 7, we obtain 


0= > >> Mip)Li(p)x’ = DD 6fD(p)2’? = D(p)s*. 
i=1 j=1 j=1 
Thus, if 
x = (z!,...,2”) 


is some solution of equation (5), then every function 2’ satisfies the differ- 


ential equation 
D(p)x* = 0. 


Since, by hypothesis, all roots of the polynomial D(p) have negative real 
parts [see §9, (A)], for the function z* the inequality 


|x| < Re, i=1,...,n, t> 0, 


holds, where R and a are positive numbers which do not depend on the 
number 7. From this inequality follows the inequality 


Ix] < Vn Re’, t>0, 


which has already been proved [see §11, (B)] under more general assump- 
tions; here this proof will be carried out again. 
Let e; be the unit coordinate vector corresponding to 7, so that 


e; = (0,...,1,...,0), 


where the ith place contains the number 1. Further, let y,(t) be the 
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solution of (5) with initial value e;, so that 
¥i(0) = e@, t=1,...,n. 
Then the solution y(t, &) of (5) with the initial value 
£ = (é',..., &) 


can obviously be written in the form 


vit, &) = D> E'y.(0). (8) 
1=1 
Since the inequality 
lwi(t)| < Vn Re“, t>0, 


holds for every solution y;(¢), it follows that inequality (6) holds for the 
solution y(t, &). 

Lyapunov stability of the equilibrium state x = 0 follows directly from 
(6). Actually, if € is a given positive number, then it is sufficient to take 
for 6 the number e/r. Asymptotic stability also follows from (6). 

The Lyapunov function. In establishing a criterion for the equilibrium 
state of the nonlinear system (1), the so-called differentiation with respect 
to a system of equations is used; this type of differentiation finds other 
applications in addition to the proof of Lyapunov’s theorem. 

(B) Let 

F(a!,...,2") = F(x) 


be some differentiable function of variables z!,... , x” defined in a domain 
A. Its t-derivative with respect to the system of equations (1) is de- 
fined at the point x = (x’,...,2") in the following manner. Let ¢(t) 
be a solution of equation (2) which at some value ¢ = fp satisfies the 
initial condition 
e(to) = x. 
The derivative ; 
F(1)(x) 


with respect to system (1) is defined by the formula 
. d 
Pay(x) = di F(e(0))|tato» 


or by means of the total derivative formula 


Paya = 7 FO pie, (9) 


i= 


26] LYAPUNOV’S THEOREM 205 


From formula (9) it is evident that the derivative F,,,(x) does not depend 
on the solution g(t) but is determined uniquely by the choice of the point x. 

We shall now prove a property of an autonomous system. 

(C) The solution of the autonomous system (2) with initial values 0, & 
will be denoted as before by g(t, £&). The function g(t, £) satisfies the 
identity 

o(t, o(s, £)) = o(s + 4, &). (10) 


We shall prove formula (10). Let us set 
n = 9(s, &), (11) 
where s is a fixed number, and consider the solution 
ei(t) = eft, n) 


of equation (2). Since g(t, £) is a solution of (2) and since this equation is 
autonomous [see §15, (A)], the function g2(t), which is determined by 
the relation 


vot) = o(t + s, &), 


is also a solution. We thus have two solutions, »;(¢) and goa(t), of equation 
(2). Furthermore, 


¢1(0) = 90,9) =n 
[see (4)], and 
92(0) = o(s, £) = n 


[see (11)]. Therefore, since the solutions ¢,(¢) and go(t) have common 
initial values, they coincide and relation (10) holds. 

In the proof of Lyapunov’s theorem the basic role is played by a certain 
positive definite quadratic form called the Lyapunov function. First we 
shall note certain properties of positive definite quadratic forms [see (D)], 
and then we shall construct the Lyapunov function itself [see (E)]. 

(D) Let 

x= (v',..., 27) (12) 


be a variable vector in an n-dimensional space. By a quadratic form of 
the vector x we shall mean a function W(x) determined by the formula 


W(x) = > wijt's’, 


t,j=l 


where w,; = wj; are real numbers. The quadratic form W(x) is called 
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positive definite if for x ~ 0 we have 
W(x) > 0. 


We find that for any positive definite quadratic form W(x), it is always 
possible to choose two positive numbers u and vy such that the inequality 


ulx|? < W(x) < vx)? (13) 


holds for an arbitrary vector x. From this it follows that for an arbitrary 
x [see (12)] the inequality 


Iz'| < V: W(x) (14) 
holds. 
We shall prove the existence of the numbers uw and v. For this we shall 
consider the values of the function W(£) when the vector £ belongs to 
the unit sphere, i.e., when & satisfies the condition 


[é| = 1. (15) 


Since the sphere (15) is a closed bounded set and the function W(E£) is 
continuous, it attains its minimum p and its maximum » on the sphere (15). 
Since all vectors of the sphere (15) are nonzero, the numbers w and v are 
positive. Let x be an arbitrary vector; then we have x = X&€, where the 
vector £ belongs to the sphere (15) and therefore the inequalities 


h< WE) <v 


hold for the vector £. By multiplying this last inequality by \7, we obtain 
the inequality (13). 
We shall now proceed to the construction of the Lyapunov function. 
(E) Let 


ne . 
t= > ajz’, t=1,...,27, (16) 
j=1 
be a linear homogeneous system of equations with constant coefficients, 
where all eigenvalues of the matrix A = (a;) have negative real parts. 


Then there exists a positive definite quadratic form W(x) whose derivative 
with respect to the system (16) [see (B)]} satisfies the inequality 


Wae(x) < —BW(x), (17) 


where x is an arbitrary vector and 8 is a positive number independent of x. 
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Let us construct the form W(x). We shall assume that the system (16) 
is the scalar form of the vector equation (5). The solution of (5) with initial 
values 0, £ will be denoted by y(t, £) as in proposition (A); we then have 


vit, &) = do EyAld) (18) 
i=l 
[see (8)]. We now form . 
w(t) = i I(r, &)|? ar. (19) 


It follows from (18) that 


WwW) = > ee i (Wilt), ¥i(r)) ar. (20) 


i j=l 


Since every function y(t) satisfies (6), each improper integral on the right- 
hand side of (20) converges, so that, W(x) is a quadratic form for the vector 
£. This quadratic form is positive definite, since for £ ~ 0 the integrand 
of (19) is positive, so that W(é) > 0. Let us now calculate the derivative 
Wis) (£) of the function W(£) with respect to the system (16). To do this, 
according to the hypotheses of (B), we draw the solution y(t, £) through 
the point £ and then calculate the derivative of the function W[y(t, &)] 
at £ = 0. We remark first that, as a consequence of (C), 


¥(7, v(t, &)) = vir +2, 8), 
so that 


Wit, &)) = / lyr, w(t, £))|? dr 


0 


=| [W(t + 7, &)|? dr -{ lv(r, &)|? dr. 


Thus we have 


Was =F we, | = a f Iv(r, &)|” dr 


t= 


= —|¥(t, &)7|-0 = —IEl’. 
We have thus obtained the equality 


Wae(é) = —|el’, 


but, by the second of the inequalities (13), we have 


|? < —i we, 
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and therefore we obtain 
. 1 
Wael) < — 3 W(é). 


Thus inequality (17) is proved. 
Lyapunov’s theorem. Let us proceed, finally, to the formulation and 
proof of Lyapunov’s theorem. 
Let 
a = (a',...,a”) 


be an equilibrium state of the autonomous system (1). We shall set 
vi=-aitaAz', i=1,2,...,n, (21) 
and take as new unknown functions the quantities 
Az!,..., Ax”. (22) 


Substituting (21) into the system (1) and expanding the right-hand sides 
into Taylor series in the variables (22), we obtain 


aa’ = s'(@) + EO) ae + Re i=1,...,, (28) 
j=1 


where R* is an infinitesimal of the second order: with respect to the un- 
knowns (22). Since a is an equilibrium state of the system (1), 


f(a) = 0; 
in addition, if we set 
i _ af'(a) 
j= (24) 
we can write (23) in the form 
At’ = Do ajax’ +R, t=1,...,n. (25) 


j=1 


THEOREM 19, If all eigenvalues of the matrix A = (ai) [see (24)] have 
negative real parts, then the equilibrium state a of the system (1) is 
asymptotically stable; more precisely, there exists a positive number o 
so small that for |— — a] < oa, the inequality 


lett, €£) — al < rl— — ale, (26) 


holds, where 7 and a@ are positive numbers which do not depend on £. 
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Proof. We shall assume that the equilibrium state a of (1) coincides with 
the origin, i.e., that a = 0. This can always be attained by parallel trans- 
lation of axes; the matrix A is invariant under this transformation. As- 
suming that a = 0, we have 


ai? +R, i=1,...,2, (27) 


\| 
~~, 
i Ms 
U 


where 


nr - 
t= Via’, i=1,...,2, (28) 


which can be obtained from (27) by linearization, that is, by discarding 
the remainder terms R’*. If we calculate the derivative W(27)(x) of the func- 
tion W(x) with respect to the system (27), we have 


Weon(2) > ow) ai =] 4 »» oe t 


dW (x) p 
xt 


W 28)(x) + > 


Since the function W(x) satisfies (17), we have 


aW(x) i 
Ox? BR. 


Wane) < 6) + Y 


Let us now choose a positive number 6 so small that for 
W(x) <b (29) 


the vector x will belong to the domain A [such a number exists by virtue 
of (13)]. Since the second derivatives 3°f‘(6x)/dx’ ax*, being continuous 
functions, are bounded in the ellipsoid (29), they are also bounded in the 
ellipsoid 


[R'| < kjx|? < = WO), 
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where k is a certain constant. Furthermore, since 9W(x)/dz’ is a linear 
forminz!,..., 2”, then 


< WWs), 


Buia 
Oxt 


where / is a certain constant [see (14)]. Thus there exists a positive number 
q such that for W(x) < b we have 


If we now choose a positive number c in such a way that 


c<b, aves, 


we have 


Went) < — § W@) 


whenever the inequality 


W(x) <c (30) 
is satisfied. If we set a = 8/4, we obtain the inequality 
Wi27)(%) < —2aW(x), 


which is valid if inequality (30) holds for x. 
Let £ be an interior point of the ellipsoid (30), i.e., a point satisfying the 
inequality 
Wf) <. (31) 


We shall denote the solution of (27) with the initial values 0, £ as before 
by o(t, £), and we set 
w(t) = Welt, £)). 


The function w(t) is defined for all those values of t > 0 for which the 
solution g(t, £) is defined, and by (B) it satisfies the condition 


wt) < —2aw(t) (82) 


whenever 
w(t) <e (33) 


holds. If the solution g(t, &) does not exist for all positive values t, then 
the point x = g(t, £) must necessarily leave the ellipsoid (30) with in- 
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creasing ¢ [see §24, (B)]. Let us assume that the point x = g(t, £) leaves 
this ellipsoid, and let ¢? > 0 be the first value of t for which the point crosses 
the boundary. Then on the interval 0 < ¢ < ?’ the point g(t, £) belongs 
to the ellipsoid (30) so that (82) holds and w(¢) is not positive. Con- 
sequently we have a contradiction: ec = w(t’) < w(0) <e. 

Thus the solution g(t, £), as well as the function w(2), is defined for all 
positive values of t, and inequality (32) holds for all these values. If 
£ ~ 0, then w(t) > 0, and we can carry out the following calculations, 
beginning with inequality (32): 


t 
oO) <9. [x _ . 
wld) < —2a; » w(t) dt < —2ed, for t> 0; 


In w(é) — In w(0) < —2eadt. 
The last inequality gives 

W(t, £)) < Wee". 
Combining this inequality with (13) we obtain 


lv(t, |? < 7 le7e-?*", tt > 0, (34) 


which is valid whenever (31) holds for &. 
Inequality (31) follows from the second of the inequalities (13), together 
with the relation 


lsh <<o= Ne (38) 


Thus if (35) is valid, then inequality (34) is also valid, and if we take the 
square root of (34), we obtain the inequality 


lo Ol sf lee, 1 > 0, 


which coincides with (26), where r = \/v/u anda = 0. Thus Theorem 19 
is proved. 

The following proposition (I°) describes a case which is in a certain sense 
opposite to that considered in Theorem 19. 

(I) The equilibrium state a of equation (2) will be called completely 
unstable if there exists a positive number o such that any solution o(é, &) 
of equation (2) starting at the point £ ~ a of the sphere [—£ — al <o 
leaves this sphere and does not return to it. That is, there exists a positive 
number 7’ = T(£) such that for ¢ = T' the solution ¢(E, &) is defined, and, 
for all values t > T for which this solution is defined, it satisfies the in- 
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equality [p(¢,£) — al > o. It turns out that if all eigenvalues of the 
matrix (df‘(a)/dx’) have positive real parts, then the equilibrium state a 
of equation (2) is completely unstable. 

To prove proposition (I*) we shall use certain results established in the 
proof of Theorem 19; here, as before, we shall assume that a = 0. In 
order to prove this, let us consider along with equation (2), for which 
all eigenvalues of the matrix (df*(a)/dxz’) have positive real parts by hy- 
pothesis, the equation 

x= —f(x), (36) 


for which the point 0 obviously satisfies the conditions of Theorem 19. 
By virtue of the construction given in the proof of Theorem 19, a Lyapunov 
function W(x) exists for equation (36) and satisfies the inequality 


W36(x) < —2aW(x) 


under condition (30). Writing out explicitly the left-hand side of this 
inequality [see (9)], we obtain 


Wen) = dy YO (yw) < -2aWeH) 
t=1 


or . 
WwW, 1)(x) > 2aW (x) . 


This inequality is automatically valid whenever (30) holds. Now let & 
be some interior point of the ellipsoid (30) [see (31)]. Let us set 


w(t) = Welt, &)). 


The inequality 
w(t) > 2aw(t) (37) 


holds for the function w(t) whenever the inequality 
w(t) <e 
is valid. Since & # 0, then w(t) > O and the following calculations can 


be carried out, starting from inequality (37): 


t 
w(t) J w(t) 
wit) > 2a; » w(t) dt > 2at, for t> 0; 


w(t) > wOe?*; Wlolt, £)) > We?" 


T'rom the last inequality it follows that, as ¢ increases, the point x = 
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g(t, £) goes out to the boundary of the ellipsoid (30) and hence leaves the 
interior. We shall show that it then cannot return to the interior of the 
ellipsoid (30). If we assume the opposite, then we could find a positive 
value ¢’ such that w(t’) = c and, for all sufficiently small positive values 
of At, the inequality w(t’ + At) < ¢ would hold. From the last two 
relations it would follow that w(t’) < 0, thus contradicting (37) which 
holds for ¢ = #’, since w(t’) = c. Thus we have proved that the trajectory 
x = o(t, £), where £ ¥ 0 is an interior point of the ellipsoid (30), must 
leave the ellipsoid (80) and cannot return to it. Irom the second of the 
inequalities (13) and from the inequalities (35) we obtain (31), so that 
the sphere (35) is contained in the ellipsoid (30). What we have proved, 
in view of this fact, implies the validity of the assertion (I). 


EXAMPLE 


As a supplement to (A) we shall show that if the matrix A has an eigen- 
value \ with a positive real part, then the equilibrium state x = 0 of 
equation (5) is no longer Lyapunov stable. Actually, by (A) of §14, the 
solution of (5) is a vector function x = che*’, where c is an arbitrary real 
constant and h is an eigenvector of the matrix A with the eigenvalue X. 
If \ is a real number, then for sufficiently small ¢ the solution starts at the 
point ch, which is arbitrarily close to the equilibrium state x = 0, but 
whose modulus becomes arbitrarily large with time. If \ is a complex 
number, then the solution c(he™ + he™) of equation (5) has the same 


property. 


27. The centrifugal governor and the analysis of Vyshnegradskiy. In 
modern technology, thanks to the abundance of automatic control devices, 
a large role is played by the theory of automatic control. One of the most 
important problems arising for the designer of an automatic control de- 
vice is that of the operational stability of a machine-control system. This 
problem can be solved in many cases on the basis of Lyapunov’s theorem 
(see §26). 

The oldest existing automatic control system is the system of the steam 
engine and Watt’s centrifugal governor. The centrifugal governor, which 
at the end of the 18th and the first half of the 19th centuries handled its 
task quite adequately, began to operate unreliably because of design 
changes in the middle of the 19th century. A number of theoreticians and 
engineers sought a way out of the crisis that had arisen. The problem was 
solved with complete clarity and simplicity by the outstanding Russian 
engineer Vyshnegradskiy, one of the originators of the theory of auto- 
matic control. Vyshnegradskiy’s work, Direct-action governors (1876), was 
one of the first studies in the theory of machine control which sought to 
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Centrifugal 
governor 


To steam 
engine 
cylinder 


Figure 46 


answer questions raised by industrial practice. In the present section, 
Vyshnegradskiy’s study is presented in a simplified form. 

The centrifugal governor (Fig. 46) is a vertical spindle or rod S which 
can rotate about its vertical axis, at the upper end of which are attached 
on hinges two identical arms L, and Ly with identical weights at their 
ends. The arms L, and Ly are joined together by supplementary links so 
that they can only deviate from the vertical position simultaneously by 
the same angle y, and they lie in a common vertical plane which contains 
the spindle S. When the arms L, and Ly deviate from their vertical posi- 
tion by an angle ¢, they set in motion with the aid of the links a special 
sleeve M, which is fitted on the spindle S so that the distance from this 
sleeve to the upper end of S is proportional to cos ¢. The length of the 
vertical arms L, and Le will be taken as unity, and the mass of each of 
the weights fixed on their ends will be denoted by m. If the spindle S 
rotates with angular velocity @ and the arms L, and Lz are inclined from 
the vertical position by an angle ¢, then each of the weights is subject to 
the centrifugal force 

mé? sin 9. (1) 


Simultaneously, a gravitational force equal to 
mg (2) 


acts on each weight. Since the forces acting upon the weights in the direc- 
tion of the arm L, are balanced by the reaction of arm Z;, then, in order to 
calculate the force acting on the weights, it is necessary to resolve both of 
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—mg sin » ‘ 


Ficure 47 


these forces in the direction of the axes, the first of which is parallel to 
the arm and the second perpendicular to the arm, i.e., in the direction of 
increasing ¢. It is immediately seen (lig. 47) that the component of the 
force (1) in the direction of increasing ¢ is equal to 


m6? sin ¢ cos ¢, (3) 


and the component of the gravitational force (2) in the same direction is 
equal to 
—mg sin ¢. (4) 


Thus the resultant of the two forces (3) and (4) is given by the formula 
m6" sin g cos @ — mg sin ¢. (5) 


The simplified version of the performance of a centrifugal governor is 
that for a given angular velocity 6, the arms L, and Le, deviate by an 
angle ¢ under the action of forces (1) and (2); the angle ¢ may be de- 
termined from the equality 


mé? sin g cos y — mg sin ¢g = 0, (6) 


L.e., by equating the force (5) to zero. The relation (6) determines the 
angle ¢ as a single-valued monotonic increasing function of the velocity 6; 
in this sense, Watt’s governor can be considered as a measure of the velocity 
of rotation. This is the so-called static study of a governor. In reality, we 
have here a dynamic phenomenon. Under the action of the force (5), the 
mass m performs a motion described by a differential equation. In addition 
to the force (5), the mass m is acted upon during its motion by a frictional 
force in the hinge joints. This force depends in a rather complex manner 
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on the motion which takes place. Simplifying substantially the com- 
plexity here, we shall assume that the frictional force is proportional to 
the velocity ¢ of the motion of the mass m and that its sign is opposite 
to that of ¢, i.e., it has the value 


—b¢, 


where b is a constant. Thus, if we take ¢ as the coordinate which de- 
termines the position of the mass m, then we obtain for ¢ the differential 
equation 

me = m6 sin g cos ¢ — mg sin g — b¢. (7) 


[The calculation of the force (5) is carried out here under the assumption 
that 6 and ¢ are constants. For variable 6 and ¢, additional forces enter, 
which, however, are balanced by the reactions of the arms and of the 
hinges which force the arms to move in a plane. Thus equation (7) turns 
out to be valid.] 

The steam engine is represented by a flywheel with moment of inertia J, 
which is set into rotary motion by the force of the steam and is capable of 
performing useful work (for example, hoisting a cage from a mine). The 
differential equation of a steam engine can thus be written in the form 


Ja—=P,—P, (8) 


where w is the angular velocity of the rotation of the flywheel, P, is the 
moment due to the action of the steam, and P is the moment due to the 
weight of the cage acting on the flywheel. The moment P, depends on 
how much the valve is opened to admit steam into the cylinders of the 
steam engine, and the moment P depends on the load of the cage. 

The centrifugal governor is connected to a steam engine in order to 
maintain a uniformity of operation. It “measures” the speed of rotation 
of the flywheel and, if it is too great, decreases the supply of steam; if it 
is too small, it increases the supply of steam. To this end, the flywheel 
of the steam engine is connected by a set of gears with the spindle of the 
governor (Fig. 46) so that between the angular velocities w and 6, there 
exists a constant relation 

6 = nw, (9) 


where n is the so-called transmission ratio. This is the effect of the engine 
on the governor, as a result of which the flywheel speed is measured. On 
the other hand, the sleeve M of the governor is connected with the valve 
which admits the steam, so that 


P, = F, + k(cos g — cos ¢*), (10) 
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where ¢* is a certain “mean” value of ¢ near which the regulated value ¢ 
must be maintained, F, is the value of the force due to the action P, 
of the steam at »¢ = ¢*, andk > 0isa constant factor of proportionality. 

As is obvious from (10), the reverse action of the governor on the steam 
engine is realized in such a way that as the angle ¢ increases, the supply 
of steam (together with the force due to the action P, of the steam) de- 
creases. As a result of the interaction described between the engine and 
the governor, the latter, it would seem, completely accomplishes its task, 
increasing the steam supply when the flywheel speed decreases and de- 
creasing the steam supply when the flywheel speed increases. In this 
connection, it is natural to expect that the rotational speed of the flywheel 
will be stabilized. This was in fact observed in steam engines built before 
the middle of the nineteenth century. In order to explain the reasons for 
the breakdown in performance of the governor in steam engines, which 
began to be observed after the middle of the 19th century, it is necessary 
to make a detailed study of the dynamics of the performance of the engine- 
governor system and of its stability, which is what was done by Vyshne- 
gradskiy. 

As is evident from relations (7) to (10), the engine-governor system is 
described by the two differential equations 


me = mn*w’ sin y cos y — mg sin y — b¢, 
(11) 


Ja = keosy — F, 


where F = P — F, + k cos ¢* is a quantity which depends on the load. 
The first of these equations is of second order. To reduce this system to 
a normal form we shall introduce a new variable y by setting 


y= 4%, 
so that system (11) may be written in the normal form 
e=Y, 
; 22°: . b 
y= nw sin g cosy — gsing — ~ v, (12) 


. k F 
o= 7 cos g — 7 . 
The satisfactory performance of a steam engine requires that the angular 
speed w of the flywheel remain constant both for a fixed load P, i.e., for a 
constant F, and for a stationary steam supply valve. Because of the latter 
requirement the angle y remains unchanged. Thus it is a matter of seeking 
a solution of the system (12) of the form 


P= #0) y = 0, @ = Wo, 
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i.e., finding the state of equilibrium of this system. The problem is that of 
finding the state of equilibrium of system (12) and then studying its 
stability. 

By equating to zero the right-hand sides of (12) and solving the re- 
sulting equations, we find the coordinates of the equilibrium state 


Yo = 0, 
COS Yo = ‘ (13) 
n2u3 = —2 
COS go 


Let us set 
e = got Ag, ¥ = vo + Ay, @ = Wo + Aw. 


As a result of such substitution and linearization of equations (12), we 
obtain the system 


Ag = Ay, 


. b 
Ay = n> we cos 2¢9 Ag + n* Wo sin 2¢9 Aw — g COS gp Ag — m Ay, 


Kk. 
Aw = — F Sin go Ag. 


Substituting into the second of these equations the value of n7w% given 
in (13), we obtain, after a simple calculation, 


» 2 . 
g Sin” ¢o b 29 SiN Yo 
= — ————— Aw. 
Ay COS Yo Ae m Ay + Wo ° 


The characteristic polynomial of the linear system of equations obtained 
for Ag, Ay, Aw is 


—p 1 0 
- 2 . 
_ |_ gsin™ ¢o bb 2g SIN go 
D(p) = COS ¢o m ?P Wo , 
kt. 
— 7 sin 0 0 —p 


or, after calculating the determinant and multiplying it by —1, 


b sin? 2kg sin? 
—Dp) = w+ + at 


27] THE CENTRIFUGAL GOVERNOR 219 


All coefficients of this polynomial are positive, and therefore the necessary 
and sufficient condition for its stability is (by Theorem 6) that the in- 
equalities 

b gsin” go 2kg sin? vo 

—.f2 > 1 4 

M COS go Ja 


or 


by > 2k cos ¢o 2F 
m Wo ~ Wo 


(14) 


{see (13)] are satisfied. Relation (14) represents, by Lyapunov’s theorem 
(Theorem 19), the sufficient condition for the stability of the engine-governor 
system. 

In order to clarify the meaning of the right-hand side of the last in- 
equality, we shall introduce the concept of nonuniformity of performance 
of a steam engine, which plays an important role in engineering. From 
(13) it is evident that changing the value F = P — F, + k cos ¢* (ie., 
changing the load P) alters the stable speed wo. The quantity dwo/dP 
characterizes the rate of change of wo when the load P is changed. Its 
absolute value vy = |dwo/dP|(as we shall soon see, the derivative dw)/dP 
is negative) is called the nonuniformity of performance of the steam 
engine. By (13) we have 

Fw? = const, 


and therefore by differentiating, we obtain 


dw _ __ Wo 
dr 2F 
Thus 
v= pal , 
2F 


and the stability condition (14) may be rewritten in the: final form 


wy > 1. (15) 


From formula (15) Vyshnegradskiy made the following deductions: 

1. An increase of the mass m of the balls has a harmful effect on the 
stability. 

2. A decrease of the coefficient of friction 6 has a harmful effect on 
stability. 

3. A decrease of the moment of inertia J of the flywheel has a harmful 
effect on stability. 

4. A decrease of the nonuniformity v has a harmful effect on stability. 
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In order to make his conclusions accessible to engineers and to attract 
attention to the more important results, Vyshnegradskiy formulated his 
famous “theses” at the end of his work. 

First thesis. The cataract (friction) is an essential element of a sensitive 
and correctly operating governor, or briefly, “without a cataract, there 
can be no governor.” 

Second thesis. Astatic governors (i.e., governors with zero nonuniformity) 
should not be used even with the cataract, or briefly, “without nonuni- 
formity, there can be no governor.” 

The breakdown in the performance of governors in the middle of the 
19th century is explained by the fact that, due to the development of 
engineering, all four quantities appearing in (15) were subjected to changes 
which served to diminish the stability. Specifically, because of the in- 
creasing weight of the valve (in connection with the growth of engine 
power), heavier balls were being used. Improved machining of the surfaces 
of the engine parts led to a reduction in friction. The increase of the 
operating speed of the engines made it necessary to decrease the moment 
of inertia J of the flywheel. Finally, the tendency to decrease the de- 
pendence of speed on load led to the reduction in nonuniformity of per- 
formance. 

Having explained the unfavorable effects of all the factors indicated, 
Vyshnegradskiy recommended in his theses an artificial increase of friction 
(by means of a special device, the cataract) and an increase of the non- 
uniformity of performance (by changing the numbers n and k, which de- 
pend on the design of the engine). 


28. Limit cycles, In this section we shall define, and to a certain degree 
study, the concept of limit cycle which was introduced by the Irench 
mathematician Poincaré; we shall also give a criterion for establishing the 
existence of a limit cycle. It should be noted that at the present time the 
concept of limit cycle plays a most important role both in the theory of 
ordinary differential equations and in its applications in engineering. 

We shall investigate the normal autonomous system (see §15) of equa- 


tions . . 
p= fial...,2), i@=1,...,2, (1) 


whose right-hand sides are defined and have continuous partial derivatives 
of'/dx’ in a certain domain A of the phase space R of the variables x',... , 
™ We shall also use the vector form of this system 


x = f(x). (2) 


The most essential considerations of this section will be concerned with 
the case n = 2. In order to emphasize that this case is two-dimensional, 
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we shall speak about the phase plane P of system (1) and not about its 
phase space R. In the study of phase planes, intuitive geometrical con- 
structions will play an essential role. The case when the domain A coin- 
cides with the entire phase plane P is by no means trivial, and, for the sake 
of simplicity, our entire attention can be concentrated on it. 

The limit cycle and the behavior of trajectories in its vicinity. By a limit 
cycle of equation (2) (n = 2) we shall mean an isolated periodic solution 
of this equation. More precisely, let x = y(t) be a periodic solution of (2) 
and let K be the closed curve in the plane P described by this solution. 
The solution x = g(t) (and also the trajectory K) is assumed to be an 
zsolated periodic solution and is called a limit cycle if there exists a positive 
number p such that for any point £ in the plane P, whose distance from K 
is positive and smaller than p, the solution of (2) passing through the point 
£ is not periodic. 

This means geometrically that there can be no other closed trajectories 
of this equation in the vicinity of the closed trajectory K. The question 
of how the trajectories of equation (2) behave in the neighborhood of a 
limiting cycle K is answered by the following theorem. 


THEOREM 20. Let x = g(t) be a limit cycle of equation (2) (n = 2) 
and K a closed trajectory described by this solution in the plane P. The 
closed curve, as is known, separates the plane into two domains, an 
interior and an exterior; since the trajectories of (2) cannot intersect, each 
trajectory distinct from K is interior or exterior to the trajectory K. 
It is found that for exterior as well as for interior trajectories there are 
two mutually exclusive possibilities of behavior in the neighborhood of K. 
That is, all interior trajectories starting in a neighborhood of K spiral 
around K, either as t > +oo [Fig. 48(a)] or as £ > —oo [I’ig. 48(b)]. 
The same is true also for exterior trajectories [I*igs. 48(a) and (b)]. 

If all trajectories (both interior and exterior) start in the neighborhood 
of K and spiral toward K as t — +o, then the limit cycle is called 
stable (Fig. 48(a)]. If all trajectories starting in the neighborhood of K 
spiral away from K ast — — oo, then the limit cycle is called completely 
unstable [Fig. 48(b)]. In the two other cases (i.e., if the interior trajec- 
tories spiral away from K ast — —o and the exterior trajectories spiral 
toward K as ¢—-+oo) the limiting cycle K is called semistable 
(T'ig. 48(c)]. 


Proof. To the proof of Theorem 20 we preface a proposition. 

(A) Let x = g(t, —&) be a solution of equation (2) (n = 2) with initial 
values 0, £, and let L be a curve defined in the plane P by an equation in 
vector parametric form 


x= ¥(u), (3) 
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(hb) (c) 


Figure 48 


where the vector function ¥(u) of the parameter u has a continuous deriva- 
tive which is not zero. If the trajectory x = ¢(é, £)) and the curve L 
intersect at the point 


v(to, £o) = ¥(uo) = b (4) 


but are not tangent there, i.e., the vectors f(b) and y(uo) are linearly 
independent, then for sufficiently small | — £o|, the trajectory ¢(t, £) 
and the curve L intersect at the point 


o(t(é), £) = v(u(é)), 


where the values i(£) — tp and u(£) — wo are small and the functions 
i(£) and u(£) have continuous derivatives with respect to the components 
of the vector €. From this it follows, in particular, that every trajectory 
passing sufficiently close to b intersects the line L. 
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For the proof let us consider the vector equation 


e(t, £) — ¥(u) = 0 (5) 


determining the desired point of intersection, in which the variables t 
and u are unknown functions of the vector £. By hypothesis, equation (5) 
has the solution ¢ = to, u = uo at & = &p [see (4)]. In order to prove the 
existence and differentiability of the solution of (5) we write this equation 
in the scalar form 


e\(t, £) — ¥i(u) = 0, 
g(t, &) — v?(u) = 0. 


For ¢ = to, u = uo, & = £o, the functional determinant of this system 
is obviously 


1 2 
— Hb) fb) 

1 2} , 
_% _& v™(uo) -¥?"(ue) 


which does not vanish since by hypothesis the vectors f(b) and (ug) are 
linearly independent. Hence, by the well-known implicit function theorem, 
and also by Theorem 18 on the differentiability with respect to the initial 
values, the validity of proposition (A) follows. 

The proof of Theorem 20 is based on a consideration of the succession 
function of equation (2) in the neighborhood of the closed trajectory K. 
In the construction of this function we shall use only the fact that K is 
a closed trajectory, and not the fact that it is a limit cycle. The period of 
the trajectory K will be denoted by 7. Let L be a curve which is defined 
on the plane P by equation (8) and which intersects K at a unique point 
b = y(uo) and is not tangent to it at this point. For example, we can 
assume that Z is a straight-line segment with the interior point b. Let p 
be a variable point of the curve L. Since for p = b the trajectory ¢(, p) 
intersects L at the point b for ¢ = T’, it follows from proposition (A) that, 
for p close to b, the trajectory y(t, p) intersects Z at the point q for ¢ close 
to T and that the point q is the successor of the point p (Fig. 49). This 
means that on the segment pq of the trajectory y(t, p) there are no points 
of intersection with the curve L which are distinct from p and q (see 
Theorem 18). If 

p= vu), a= ¥(), 


then by proposition (A) the quantity » is a function of u, that is, vy = X(u), 
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K 


Figure 49 


where X’(u) is continuous. The function x(u) is called the succession 
function. It is obvious that 


X(ug) = to. (6) 


Furthermore, since the trajectory (é, p) cannot intersect K for p ¥ b, 
the points p and q lie on the same side of K, so that the numbers 


u— Uo, X(u) — up (7) 


have the same sign. 

We shall now show that the coordinate u of the point p is a single-valued 
differentiable function of the coordinate v of the point q, i.e., that the rela- 
tion v = X(u) has an inverse u = X~/(v). To do this we may consider, 
instead of equation (2), the equation 


x= —f{(x), (8) 


which differs from (2) only by the sign of the right-hand side. By equation 
(8), the point p is the successor of the point q, so that, by what we have 
already proved, u = X—'(v), where X~'(v) is the succession function for (8) 
in the neighborhood of its periodic trajectory K. 

Because the quantities (7) have the same sign, the derivative of the 
function X(u) at the point wo is nonnegative [see (6)], and since X(z) has 
a differentiable inverse, the equality X’(up) = 0 cannot occur. Thus the 
inequality X’(uo) > 0 holds, and therefore 


x’(u) > 0 


for all values of u sufficiently close to uo. Hence it follows that the suc- 
cession function is monotone increasing. 
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It is obvious that to every solution of the equation 
X(u) = u (9) 


corresponds a periodic solution ¢[t, ¥(u)] of equation (2), and since K is a 
limit cycle, there exists a positive number € such that on the interval 
lu — ug| < €, equation (9) has only one solution u = wo [see (6)]. We 
shall restrict our consideration of the function X(u) to that interval. 
The points ¥(u) foru < ugand u > uo lie on opposite sides of the curve 
K. To be specific, we shall assume that for u > wo they lie outside the 
curve K, and we shall study the behavior of exterior trajectories by means 
of the properties of the function X(u) in the interval uw < u < up +e. 
Since equation (9) has no solutions on this interval, one of the inequalities 


x(u) < 4, (10) 
x(u) > u (11) 


must hold on this entire interval. The first of these inequalities means that 
outside K the point q is closer to b than p on the curve L (Tig. 49), and the 
second inequality means the opposite. Inequality (11) is obviously equiva- 
lent to the inequality 

xl) <o. (12) 


We shall analyze the case of inequality (10). Let u, be an arbitrary point 
of the interval uw <u < wo + €, and let us set 


Ui4+1 => X(us), i= 1, 2, wee (13) 


By (10) the sequence 
Uy, U2,..., Ui... 


is monotone decreasing, each of its terms being larger than uo. Let u* 
be the limit of this sequence; we shall show that u* = uo. Passing to the 
limit in equation (13) as 7 — oo, we obtain x(u*) = u*, and since u = uo 
is a unique solution of equation (9), then u* = wo. The points ¥(u), 
W(uz),..., ¥(us), ... are successive points of intersection of the trajectory 
g(t, ¥(u1)] with the line Z, and because they converge to the point b for 
sufficiently small ¢, this trajectory spirals toward K as t— o (see 
Theorem 18). This is true for the trajectory which starts at any point 
¥(u,), Uo < uy < Up + €; thus, spiral trajectories of this form fill the 
entire exterior half-neighborhood of K. 

If (12) is satisfied for the succession function of equation (8), then by 
applying to equation (8) the assertion just proved, we conclude that the 
exterior half-neighborhood of the trajectory K is filled with spiral trajec- 


226 STABILITY [cHap. 5 


tories of equation (8) which spiral toward it as > +00. Since the tra- 
jectories of equations (2) and (8) coincide geometrically and proceed only 
in opposite directions, then from what was proved for equation (8) we may 
conclude in the case of equation (2) that whenever (11) is satisfied, the 
exterior half-neighborhood of K is filled with trajectories which spiral 
toward K as t— —o. The behavior of interior trajectories may be 
studied in exactly the same manner. Thus Theorem 20 is proved. 

Note. In order to combine in one formulation the relation between the 
behavior of the function X(u) in the neighborhood of ug and the behavior 
both of exterior and interior trajectories, we shall consider the inequalities 


|X(u) — uel < |u — wol, 44) 
Ix(u) — uol > |w — ul. 


If the first of these inequalities holds in a half-neighborhood of the curve K 
(exterior or interior), then the point q lies closer to b than to p on the line 
L, so that the trajectories spiral toward K in this half-neighborhood as 
t — ++oo. However, if in this half-neighborhood the second of inequalities 
(14) holds, then the trajectories spiral toward K in this half-neighborhood 
ast— —o. 

The geometrical study of the succession function X(u) has a certain at- 
tractiveness. We shall represent it in the form of a graph of the equation 


v = X(u) (15) 


in the wo-plane, assuming here for convenience that up > 0. In order to 
obtain a solution of equation (9), we shall consider, along with the curve 
(15), the bisector of the first quadrant 


v=U (16) 


(lig. 50). To find all solutions of equation (9) it is necessary to find all the 
points of intersection of the curves (15) and (16). In order that the closed 
curve K be a limit cycle, it is necessary and sufficient that the point (wo, uo) 
be an isolated point of intersection of the curves (15) and (16). If these 
curves are not tangent at the point (wo, uo), that is, if x’(uo) ~ 1, then 
their point of intersection (uo, uo) is necessarily isolated. In this case K 
is called a “rough” limit cycle. Whenever x’(wg) < 1 (Fig. 50) in both 
half-neighborhoods, the first of inequalities (14) is obviously fulfilled, so 
that the limit cycle K is stable. Whenever X’(uo) > 1 (Jig. 51), the second 
of the inequalities (14) is fulfilled so that the limit cycle K is completely 
unstable. 

If the curves (15) and (16) are tangent at the point (uo, uo) but the 
curve (15) passes from one side of the bisector (16) to the other, then the 
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Fiaure 50 


Ugb------- 


ug U 


uo 


Figure 51 Figure 52 


limit cycle K is either stable or completely unstable. However, if (15) is 
tangent to the bisector (16) and lies on one side of it (Iig. 52), then the 
corresponding limit cycle is semistable. 

Criterion for the existence of a limit cycle. 

(B) Let o(é) be a certain solution of equation (2) (n is arbitrary) defined 
for all values £ > to and remaining in a closed bounded set F of the domain 
A for these values of ¢. The point p of the space F is called an w-limit point 
of the solution g(¢) if there exists an unbounded increasing sequence of 
values (larger than fo) 


lim t& = 0, 
ke 


ty, fo). 1 thy ee ey 
such that 
lim ot.) = Pp. 
The set 2 of all w-limit points of the solution g(t) is called the w-limit set. 
Thus the set 2 is nonempty, closed, and bounded. The set 0 also consists 


of entire trajectories; that is, if the point & belongs to 2, then the solution 
y(t, £) with initial values (0, £) is defined for all values of ¢, and the entire 
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trajectory g(t, —) is contained in the set Q. It is obvious that the w-limit 
set of the trajectory g(t, £) is entirely contained in Q. 

We shall prove proposition (B). I’rom the fact that the set F is closed 
and bounded it follows that the set Q (which is obviously contained in F) 
is nonempty and bounded. We shall show that the set 2 is closed. Let 


Pi, P2,---, Pe,--- 
be a certain sequence of points of the set 2 which converges to some point 
p of the set F; we shall prove that p belongs to 2. Let €1, €,...,&,... 
and 81, $2,..., 8%, ... be two sequences of positive numbers such that 
lim €, = 0; lim s, = oo. 
ko kw 


Since the point p; belongs to Q, there exists a value t;, > s, such that the 


distance between p; and g(é,) is smaller than e,. For these values 


ty, ta,..-, be. --, lim t = » 


kw 
we obtain 
him o(ts) = Pp, 


which means that p is in ©. 

We shall now show that the set 2 consists of complete trajectories. Let 
£ be an arbitrary point of the set 2 and ¢(é, £) a solution with initial values 
(0, £). Moreover, let T be a value of ¢ (which can be negative) for which the 
solution g(¢, £) is defined, so that the point o(7’, &) exists. Since the point £ 
belongs to 2, there exists an unbounded increasing sequence 


ty, lo, ..-, bey. .-, lim tk = 0, 
such that 
iim v(t.) = &. (17) 


Since the solution g(¢) is defined for all sufficiently large values of t, then 
for a given T (starting from some k) the points 


e(t. + T) = oT, o(ts)) 
are defined [see §26, (C)]. From (17) and Thcorem 18 we have 


lim (te + T) = lim eT, o(ts)) = 9(T, 8), 


from which it follows that the point g(t, £) belongs to the set Q and con- 
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P(t) 
L 


Figure 53 


sequently to the set F. Thus the trajectory o(t, £) cannot leave the set F 
either for increasing ¢ or for decreasing t, and therefore by proposition (B) 
of §24 it is defined for all values of t. Thus proposition (B) is proved. 

Let us consider some special cases of the w-limit set. If the solution 
g(t) [see (B)] is an equilibrium state, i.e., if g(t) = Xo, then the wlimit set 
of the solution g(t) obviously consists of one point x9. If g(é) is a periodic 
solution describing a closed trajectory K, then the w-limit set of y(t) 
clearly coincides with K. Finally, if K is a periodic solution and g(t) is a 
trajectory which spirals toward this solution as { — ++oo, then K is the 
w-limit set of g(). 

We shall now prove a theorem which makes it possible to establish the 
existence of a periodic solution in certain cases. Whenever the right-hand 
sides of (1) are analytic, this periodic solution will be either a limit cycle or 
will be contained inside the family of periodic trajectories (see Example 3). 


THEOREM 21. Let g(t) be a solution of equation (2) (n = 2) defined for 
all values t > t) and remaining for these values of ¢ in a closed, bounded 
set F which is contained in A, and let Q be the w-limit set of the solution 
v(t). If the set 2 does not contain any equilibrium states, then it consists 
of one closed trajectory K. Here two cases are possible: (1) g(é) is a 
periodic solution and K the trajectory described by it, and (2) the 
trajectory described by the solution g(t) as t > +00 spirals toward the 
trajectory K. 


Proof. If g(t) is a periodic solution, then the set © consists of a unique 
periodic trajectory K described by the solution g(t), and the conclusion 
of the theorem is obvious [case (1)]. Let us assume that g(t) is not periodic 
and let b be an arbitrary point of the set 2. We shall draw a straight-line 
segment ZL through the point b so that it is not collinear with the phase 
velocity vector f(b) which starts from the point b [the vector f(b) can 
not equal zero since the point b of the set 2 is by hypothesis not a state of 
equilibrium]. We choose a segment so short that all trajectories passing 
through a point of this segment will intersect it (without being tangent) 
in the same direction as the trajectory passing through b (I’ig. 53). Since 
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9(t,—h) 


Figure 54 


b is the w-limit point for the trajectory g(t) and the latter is not closed, 
this trajectory obviously must intersect the segment Z an infinite number 
of times and at distinct points [see (A)]. Let a; = g(t,) and ag = (te) 
be two points of intersection of the trajectory g(t) with the segment L 
following each other in time (t; < ¢2). That segment of the trajectory 
g(t) with ty) < t < ty will be denoted by M. Together with the segment 
aja it forms a closed curve Q, which divides the plane into two domains 
G, and G2. Let h be a small positive number. Geometrically it is obvious 
(lig. 54) that the points o(f; ~ A) and g(t, +h) lie on different sides 
of the curve Q; we shall assume that the first belongs to the domain G; 
and the second to the domain Gp». All trajectories going from the domain 
G, into the domain G2 pass through the segment a,;a_. Thus, no trajectory 
can leave the domain G2 through this segment, nor can any trajectory 
enter or leave the domain G», through the curve M, because M is a piece 
of a trajectory and trajectories cannot intersect. Since a portion M of 
the trajectory g(t) intersects Z only at its endpoints, these endpoints of 
L lie on opposite sides of the curve Q. We denote by a that endpoint of 
L which is located in the domain Gz. The whole trajectory g(t), starting 
from ¢ > t2 + A, runs in the domain G2 and cannot intersect the segment 
a a9; therefore the point b does not belong to the segment aja [see (A)] 
so that it must lie on aja. Now if a3 = ¢(t3) is that point of intersection 
of the trajectory with segment L following a, (with respect to time), then 
it is evident from analogous considerations that it lies on the segment 
(Fig. 54). Denoting by 


ag = o(t4), ..., ae = olty),.-. 


the points of intersection following each other (in time) of the trajectory 
g(f) with Z, we can verify that they form on the segment ZL a monotonic 
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sequence of points directed from a; to b. We shall show that the limit b’ 


of the sequence aj, a2,..., a%, .. . coincides with b. 
In order to show this we shall first prove that the sequence 1), tg, ..., 
é, ... increases without bound. Let us assume that lim,_,. t% = 7 < +oo. 


Then g(r) = b’ and f(b’) = 9'(r) = lime se (v(7) — o(ts))/(r — ty), 
which is impossible since the vector g(r) — ¢(t,) is directed along the 
segment L, and the vector f(b’) is not collinear with this segment. Thus 
the relation lim,z_,o¢; = +o must be fulfilled, so that the whole trajectory 
g(t) intersects L only at the points a;,a.,...,a,,... fort > t;. Con- 
sequently, this trajectory has only one w-limit point b’ on the segment L 
{see (A)], so that b’ = b. We remark that only the fact that the point b 
itself is not an equilibrium state has been used in the proof. 

We shall now show that the trajectory ¢(¢) cannot enter the w-limit set 
for any other trajectory y(t). Let us assume the opposite. Then every 
point of the trajectory g(t) is an w-limit point for ¥(¢) {see (B)]; in particular, 
the point a, will be such a point. Since the point a; is not an equilibrium 
state, then, by what has been proved above, the successive points of inter- 
section 


bi, bo,..., by... 


of the trajectory y(t) with L form a monotonic sequence which converges 
to a,, and other w-limit points of trajectory y(t) on segment L do not exist. 
But this is contradicted by the fact that all points a2, a3,... located on 
the trajectory y(t) are w-limit points of the trajectory y(t). 

Thus we have proved that an open trajectory, among whose w-limit points 
there are no states of equilibrium, cannot itself be an w-limit trajectory. 

Since the trajectory K is contained in the w-limit set 2 of the trajectory 
y(t) and this set is closed [see (B)], all w-limit points of K are contained in 
Q and therefore are not states of equilibrium. Thus the proposition proved 
above can be applied to the trajectory K, so that K must be closed. From 
the entire construction it is evident that the trajectory y(t) spirals toward 
K, and therefore the set 2 consists only of the closed trajectory K which 
passes through the point b. 

Thus Theorem 21 is proved. 


EXAMPLES 


1. We shall give an example of a system of equations of the form (1) 
(n = 2) which has periodic solutions of a different type, in particular, 
limit cycles of different forms. Initially we shall define it in polar co- 
ordinates y and p, and then we shall transform it into rectangular co- 
ordinates x and y. Remembering the subsequent transformation into 
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rectangular coordinates, we shall define it in the form 
?=1; 6 = pg(p”), (18) 


where g(u) is a continuously differentiable function defined for all its non- 
negative values. In considering it in polar coordinates we shall use only 
positive values for p. 

The set of all positive values of p for which g(p”) = 0 will be denoted 
by N, and its complement in the set of positive numbers by D. To every 
number uo from N corresponds, obviously, the solution 


g=t, p = Uo 


of equation (18). The corresponding trajectory K,, is closed; it is a circle 
in the plane P? with center at the origin and with radius uo. Since N is 
closed in the set of all positive numbers, D is open and consists of a finite 
or countable number of intervals which do not intersect each other in pairs. 
Let uy < p < Uz be one of the finite intervals. Then the closed trajec- 
tories K,, and K,, bound an annulus Q in the plane P. For all numbers 
p of the interval u; < p < wg the function g(p”) does not change sign, so 
that on the entire interval one of the inequalities 


g(p’) < 0, — g(p?) > 0 (19) 
holds. Let 
g=t p= pli,u) (20) 


be a solution of (18) with initial values t = 0, ¢ = 0, and p = u, where 
uy <u < Ue. By what we proved in Example 1 of §15, the function 
p(t, u) is defined for all values of ¢; as t > oo it approaches one end of 
the interval uy < p < ws, and as ft ~ —oo it approaches the other end. 
Hence it follows that the trajectory (20) spirals toward the circumferences 
Ky, ast— « and K,,ast—» —o. That is, if the first of the inequalities 
(19) holds, then the trajectory (20) is a spiral which winds toward K,, as 
t— x and away from K,, ast > —o (Fig. 55). If the second of the in- 
equalities (19) holds, then the solution (20) is a spiral which winds away 
from K,, as — —s and toward K,, ast > +w (Fig. 56). Thus the 
annulus Q is filled by the same type of spirals of one of two forms depend- 
ing on which of the inequalities (19) is fulfilled on the interval u,; <p < ue. 
If the set N is bounded and wu* is its least upper bound, then on the infinite 
interval u* < p < +, the trajectory (20) spirals in one direction on 
Ky», and, in the other direction, it recedes to infinity. 

If the point uo of the set N is an isolated point, then the closed trajectory 
K,, is a limit cycle, whose form depends on the type of spiral filling the 
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Fieure 55 Figure 56 


annuli adjacent to the trajectory K,,. If the point wo of the set N is not 
an isolated point, then the periodic solution K,, is not a limit cycle. If, 
in addition, the entire interval with center uo is contained in N, then the 
periodic solution K,,, is contained in a whole family of periodic solutions 
forming a set of concentric circles with a common center at the origin. 
If the number wo forms the endpoint of an entire segment of numbers of 
the set N and is, at the same time, the endpoint of an interval of D from 
the other side, then the trajectory K,,, is extremal in the family of closed 
trajectories which are contiguous to K,,, from one side, while from the other 
side the family of trajectories spirals toward K,,,. It is possible, however, 
for the contiguity of closed trajectories to a periodic solution K,, to be 
much more complex. Such cases can be easily imagined; for example, N 
can be the perfect set of Cantor. 

We shall now write system (18) in rectangular coordinates by setting 


xX = pcos g, y = psing. (21) 
By differentiating (21), we obtain 


& = pcos y — posing = pg(p)-— — p-= = ag(z” + y”) — 9; 


(22) 


y = psing + pecosy = pg(p”)-=+p-— = yg(x? + y") +2. 


BIT Dia 
PIR oie 


Thus, system (18) may be written in the form of rectangular coordinates: 
t=ag(e?+y)—y gaye? +y’) +z. (23) 


(Here g can be, for example, an arbitrary polynomial.) System (23) has 
a state of equilibrium at the origin. 
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2. Let 
aha fi(e',a?,n), 2? = fP(e', 27, w) 


be a normal autonomous second-order system whose right-hand sides de- 
pend on the numerical parameter u and have continuous first-order partial 
derivatives with respect to 21, 22, u. In addition, let 


x = f(x, p) (24) 


be the vector form of this system. The solution of equation (24) with 
initial values 0, & will be denoted by g(t, &,u); let us assume that 
v(t, Eo, #o) is a periodic solution of equation (24) at u = uo with period T. 
We shall answer the question of what happens to this solution when the 
parameter yw varies in the neighborhood of yp. 

We shall represent the solutions of equation (24) in the same plane P 
independently of the value of the parameter u. Let K be a closed trajectory 
corresponding to the solution o(¢, £9, wo) and L a smooth curve defined in 
the plane P by means of the parametric vector equation 


x= ¥(u), 
which intersects the trajectory K at the unique point 
Eo = (0, £0, uo) = (T, £o, Ho) = ¥(Uo) (25) 
and is not tangent to it. We consider the vector equation 
v(t, ¥(u),u) — 0) = 9, (26) 


in which w and ware taken as independent variables and ¢ and v as unknown 
functions. Let u vary in the neighborhood of wo and yu in the neighborhood 
of uo. We shall seek solutions for ¢ close to 7 and for v close to uo. For 
u = Up and w = yo there is the trivial solution of equation (26), ¢ = T, 
v = Uo [see (25)], and the functional determinant corresponding to the 
system of equations for these values of the variables is distinct from zero, 
since the vectors f(£o, #o) and ¥’(uo) are independent. Toru = yuo equation 
(26) defines the succession function v = X(u, uo) of equation (24) with 
# = wo in the neighborhood of the closed trajectory K. For pu close to 
Mo, the function v = x(u, u) is also defined from equation (26) and can be 
considered the succession function of equation (24) in the neighborhood of 
the periodic solution K. However, equation (24) need not have a periodic 
solution for » ¥ po. To find the periodic solution of equation (24) for u 
close to uo, we consider the equation 


X(u,u) —u=O0 (27) 
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CE) 


(a) (b) (c) 
Ficure 58 


in the unknown function u(u) of uw. If the derivative of the left-hand side 
of (27) with respect to u is distinct from zero at u = uo, uw = Mo, that is, if 


te) 
au X(uo, Ho) # 1, (28) 


then (27) automatically has the differentiable solution u(u), so that for 
p close to wo equation (24) has a unique periodic solution, which is 
“smoothly” dependent on » and which reduces to K for u = po. Condi- 
tion (28) expresses the degree of “roughness” of the cycle K, and the result 
obtained justifies our use of the word “rough,” or “coarse.” A “rough” limit 
cycle does not “disappear” (and remains rough) for small variations of the 
right-hand sides of the system; it is stable under these variations. 
If the curve given by the equation 


v = X(u, u) (29) 
in the wo-plane is tangent for ~ = uo to the bisector 
v= ut (30) 


at the point (uo, wo) with the order of contact equal to unity [Fig. 57(b)], 
then for u = yo the curve (29) lies on one side of the bisector (30), and the 
limit cycle K is semistable [Iig. 58(b)]. As the parameter u varies in the 
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neighborhood of yo, the most natural behavior of the graph of (29) is for 
the point of intersection of (29) and (30) to disappear altogether [Fig. 57(a)] 
for values of u on one side of uo, and for two points of intersection of these 
curves [I‘ig. 57(b)] to appear for values of u on the other side, so that (24) 
has two rough limit cycles close to K [I*ig. 58(c)]. Thus, when pu passes 
through uo, we do not have a limiting cycle at first [I‘ig. 58(a)]; further, at 
t = Ho, one semistable cycle appears which, for subsequent variation of 
yu, decomposes into two coarse limit cycles which are close to K. The 
phenomenon described is called the “generation” of limit cycles of equa- 
tion (24) by variation of the right-hand side. 

3. We point out some important properties of a periodic solution K of 
(2) whenever the right-hand sides are analytic. Here we shall use without 
proof the fact that the solution g(é, £) of equation (2) in this case is an 
analytic function of ¢, ¢!, £?. In the construction of the succession function 
we shall assume that the curve L is defined by an analytic equation. Under 
these assumptions the succession function x(u) will be analytic, since it is 
the solution of an analytic equation. 

Since a periodic solution of equation (2) corresponds to the zeros of the 
function x(w) — u, then, in view of the analyticity of x(u), only two mu- 
tually exclusive cases are possible: (1) the periodic solution K is a limit 
cycle in the case when upg is an isolated zero of x(u) — u; (2) the periodic 
solution K is contained in the family of periodic solutions in the case when 
X(u) — u is identically zero. If any other trajectory spirals toward K, 
then K is not contained in the family of periodic solutions and consequently 
it is a limit cycle. Thus, when the right-hand sides are analytic in the 
second case, the periodic solution K of Theorem 2 is a limit cycle. 


29. The vacuum-tube oscillator. Here we shall describe systematically 
the design of the simplest vacuum-tube oscillator, a device which is a source 
of periodic (undamped) electrical oscillations. A qualitative mathematical 
theory of the operation of the generator will be given. The equation describ- 
ing the operation of a vacuum-tube oscillator is nonlinear, and its limit 
cycle corresponds to the periodic oscillations generated by the oscillator. 
One of the first studies of the adequacy of the mathematical concept of a 
limit cycle and of the physical concept of an undamped oscillation gen- 
erated by a vacuum-tube oscillator was that of the outstanding Soviet 
scientist A. A. Andronov. Before Andronov’s studies, attempts were 
made to explain the operation of a vacuum-tube oscillator by means of 
linear differential equations; such attempts could not give the correct 
mathematical picture of the gencrator’s performance. 

(A) A triode (one form of clectron tube) is represented by a threc- 
terminal element ass. The conventional representation of the triode is 
shown in Jig. 59. Here a is the plate, h is the cathode, and s is the grid. 
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a 


Ficure 59 Fic. 60. Characteristic of a triode 


Tq 


k 


Ficure 61 


Between poles s and k there exists a voltage difference U, (the grid voltage), 
but there is no current between the poles s and k; from pole a to pole k 
the current J, (plate current) flows through the tube. The law governing 
the operation of the triode may be expressed by the formula 


T, = f(U,). (1) 


The function f is called the characteristic of the triode. We shall assume 
that it is monotone increasing and positive, and satisfies the conditions 


lim f(Us) = 0, lim F(Us) = Ty, 


U;>—” U,7 +0 


where Jy is the saturation current of the triode (Fig. 60). Usually it is 
also assumed that the maximum of the function f’(U,) is attained at the 
point U, = 0. 

The three-terminal element described in (A) under the name triode in 
reality includes besides the electron tube also the plate battery, the battery 
of the grid bias, and the filament battery. 

(B) A vacuum-tube oscillator with an oscillatory loop in the plate circuit 
is diagrammed in lig. 61. It has four junction points a, k, s, b, and con- 
sists of a triode aks [see (A)] with the characteristic f(U;), a capacitor ak 
with capacitance C, resistance ab with value ?, inductance bk of magnitude 
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L, and an additional inductance sk of insignificant magnitude. Inductances 
kb and ks are connected by a negative mutual induction M(M > 0), 
which effects the so-called feedback coupling in the oscillator. If we denote 
by J the current which flows through the resistance ba, or, what is the 
same thing, through the inductance kb, so that 


J = Iba = Ip, 


hens it turns out that J, as a function of time #, satisfies the following 
differential equation: 


LWi+Ri+i= afd). (2) 


Let us derive equation (2). By Kirchhoff’s first law, we have 
Jt Tne = Ta, (3) 


where J;,, is the current flowing through the capacitor ka. In addition, 
by the properties of the triode we have 


Tun = 0. (4) 


Applying Kirchhoff’s second law to the oscillatory loop kbak we obtain 
[see (4)] 


Liv + RIva + 1 ft dt = 0. 


Differentiating this relation, we obtain 


* ° ] 
LI yy + Flea + @ Tok = 0. (5) 


By virtue of the mutual induction between inductances kb and ks we obtain 
[see (4) and also §13, (B)] ; 
U, = MI xy. (6) 


Thus equation (2) follows from (1), (3), (5), and (6). 
(C) In the phase plane of J and J equation (2) has a unique state of 
equilibrium with coordinates 


J=f0), J=0. (7) 


This state of equilibrium is asymptotically stable if 


R> 4 f"(0), (8) 


29] THE VACUUM-TUBE OSCILLATOR 239 


FicurE 62 


and completely unstable [see §26 (I')] if 
M 
R< Gf. (9) 


The point at infinity in the plane of J, J is completely unstable in all cases. 
This means that there exists a circle K in the plane J, J so large that any 
trajectory of (2) beginning at a certain instant enters into this circle and 
remains in it. When (9) is satisfied, the state of equilibrium (7) is also 
completely unstable. Thus by Theorem 21 (sec §28) the w-limit set of any 
trajectory which is not a state of equilibrium (7) is a closed trajectory. 
Thus, whenever (9) is satisfied, the vacuum-tube oscillator is a source of 
periodic undamped electric oscillations. 

Note: With the appropriate choice of the characteristic f, equation (2) 
has a unique limit cycle, toward which spiral all other trajectories of (2) 
which are not states of equilibrium of (7). One of the characteristics with 
this property will be illustrated by an example. 

For the proof of proposition (C) we shall replace the unknown function 
J by a new unknown gz by setting 


J = «+ f(0), (10) 


so that the origin in the (x, £)-plane corresponds to the point (7). 
By substituting (10) into (2), we obtain the equation 


t+ r t+ iz z= ie L(Mz) — #(0)]. (11) 


Let g(4) denote the function of ¢ on the right-hand side of this equation. 
It is seen immediately that g is bounded and monotone increasing, and 
vanishes only for the value zero of the argument (I'ig. 62). If, in additicn 
we set 


R 

zp ~ % 
La 
=o, 
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we can write (11) in the form 
E+ 2 64+ wx = g(H). 


Introducing a new variable y = <, we obtain from this equation a normal 
system 
i= y, 


; 5 (12) 
y = —w'r — 2 by + gly). 


To find the state of equilibrium of (12), we equate its right-hand sides to 
zero: 
y= 0, 
—w*x — 2 dy + gly) = 0. 
The system obtained has a unique solution 


x= 0, y = 0. 


Thus, the origin is a unique state of equilibrium of (12), and therefore it 
follows that the point (7) is the unique equilibrium state of equation (2). 
We shall now determine the stability conditions of the state of equilib- 
rium (0, 0) of (12) by linearizing this system at the point (0, 0). We obtain 
the system ; 
t= Yy; 


(13) 
y= —w'x — 2 by + g'(O)y. 


A simple calculation gives the characteristic polynomial 
? + (25 — g/(0))A + w? 


of the linear system (13). In the new notation the conditions (8) and (9) 
take the form 25 > g’(0), 26 < g’(0), respectively. Thus, whenever (8) is 
fulfilled, the equilibrium state (0, 0) is asymptotically stable [see Theorem 
19 and §9(B)], and whenever (9) is fulfilled, it is completely unstable [see 
§26(1°)]. 

To determine the asymptotic behavior of the trajectories of the system 
(12) in the phase plane of x, y, we consider the linear system 


i=y 
(14) 
y = —w*x — 2 by, 


which is obtained from (12) by discarding the term g(y) which is bounded 
in the entire plane. A simple calculation yields the characteristic poly- 


nomial of system (14), 
y a4) N? +2 6A 4+ w?, (15) 


29] THE VACUUM-TUBE OSCILLATOR 241 


and, since the numbers 26 and w” are positive, its roots have negative real 
parts. Thus, by proposition (E) of §26 for the linear system (14), there 
exists a Lyapunov function W(z, y) which satisfies the condition 


Walt, y) < —BW(a2, y). (16) 


Let us now calculate the derivative Waa(z, y) of the function W(x, y) 
with respect to the system (12): 


: ow 
Wal, ) = Wale, ) + EY gy), a7) 
Since the function g(y) is bounded, the inequality 
Med guy) <viaa (18) 


holds [see formula (14), §26], where Y is some positive constant. 
If we now set 
2Y 


c=7? a>7) 


B 4 
we obtain from (16), (17), and (18) the inequality 
W12)(2, y) < —2eW(z, y), whenever W(x, y) > ce? (19) 


The equation 
W(x, y) = ¢? (20) 


defines an ellipse in the xy-plane. It follows directly from inequality (19) 
that at the point (x, y) of the ellipse (20) the function W(z, y) decreases 
along the trajectory of system (12) which passes through the point (c, y). 
Thus whenever the trajectories of system (12) intersect the ellipse (20), 
they must enter it. If 


c=), y= vb (21) 


is a solution of system (12) which starts at a point (&, 7) outside the 
ellipse (20), then, by setting 


w(t) = Welt), ¥(9), 
we obtain for the function w(é) the inequality 


w(t) S —2ow(t), (22) 


which is valid whenever 
w(t) > e?. 
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If we integrate (22), we obtain 


Welt), VO) < WE, ne", 


From this it follows that the trajectory (21) must enter the ellipse (20). 
No trajectory leaves this ellipse, since at its boundary points trajectories 
can only enter. 

Let K be some circle in the zy-plane containing the ellipse (20). It 
follows from what has been proved that any trajectory of (12) which is 
distinct from the state of equilibrium (0, 0) necessarily enters the circle 
K and remains inside. Since the point (0,0) is completely unstable, it 
cannot be among the w-limit points of this trajectory, so that by Theorem 
21 (see §28) it either spirals toward a periodic solution or is a periodic solu- 
tion. Thus proposition (C) is proved. 


EXAMPLE 


A. A. Andronov, who first derived the nonlinear equation (2) for the 
oscillator, considered the case when the characteristic f of the triode was 
of a particularly simple form, that is, equal to zero for negative values of 
the argument and equal to a positive constant b for positive values of the 
argument. Assuming that f(0) = b/2 and making the change of variables 
(10), we arrive at the system (12) in which the function g(y) is defined by 


—w’a, y <0, 
gy) = 5 (23) 
wa, y > 0, 


where a = b/2. The system (12) with the discontinuous function g(y) 
may be written for the upper half-plane (y > 0) in the form 


f= y, 
(24) 
y = —w*r — 2 by + wa, 
and for the lower half-plane (y < 0) it may be written in the form 
Z=y 
(25) 
y = —w*x — 2 by — wa. 


We shall assume that the roots of the polynomial (15) are complex. Thus 
the state of equilibrium (0,0) of (14) is a stable focus [see §16, (C)]. 
However, the systems (24) and (25) differ from system (14) only by a 
translation; their states of equilibrium are located not at the origin as in 
the system (14), but at the point (a,0) of the system (24) and at the 
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point (—a, 0) of the system (25). We remark that the spirals of the linear 
system (14) wind around the state of equilibrium (0, 0) 1n @ clockwise direc- 
tion, and we note further that in the circuit of a half-turn of the spiral the 
phase point approaches the origin, so that its original distance from the 
origin is multiplied by a certain number } < 1 which does not depend on 
the initial position of the point [see §16, (C)]. 

In order to visualize the phase plane of system (12) in the case when the 
function g(y) is defined by (23), it is necessary to fill the upper half-plane 
with half-turns of spiral trajectories of system (24) and the lower half-plane 
with half-turns of spiral trajectories of system (25). However, in crossing 
the line y = 0, it is necessary to pass continuously from certain trajec- 
tories to others. On the basis of this description of the phase picture of 
system (12) [sce (23)], we shall look for its closed trajectories. 

Let us consider the trajectory of system (12) [see (23)] which starts on 
the axis of abscissas at the point & > 0. Since the motion in the phase 
plane of (12) is clockwise, the trajectory will go from the point chosen into 
the lower half-plane and consequently will be governed by system (25). 
After one half-turn of the spiral in the lower half-plane, the phase point 
again crosses the axis of abscissas at a point with the coordinate 


—(a+ a+ 8). (26) 


This follows from the fact that after one half-turn of the spiral, the distance 
of the phase point from the state of equilibrium (—a, 0) is multiplied by 4. 
The point with coordinate (26) lying on the axis of abscissas will then 
move by virtue of system (24) and, after one half-turn of the spiral in the 
upper half-plane, it will arrive at the axis of abscissas at a point with the 
coordinate 


a+ r(2a+r(a+ £)). (27) 


Thus a trajectory starting at a point with coordinate — > 0 on the positive 
part of the axis of abscissas will, after a complete turn, again contact the 
positive part of the axis of abscissas, but this time at the point with co- 
ordinate (27), and we obtain a mapping xX of the positive semiaxis of ab- 
scissas into itself, defined by the relation 


x(£) = a+ 2a + d2a + D7E. 


The function x(£) is the succession function for the system (12) [see (23)]. 
There is only one value of £ which satisfies the condition 


x(t) = &, 


and to this value of £ corresponds the limit cycle of system (12), which is 
both rough and stable, since x’(f) = A” < 1 (sec §28). 
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30. The states of equilibrium of a second-order autonomous system. 
In this section we shall classify and study nondegenerate states of equilib- 
rium of the normal autonomous second-order system 


& = f(x, y), 


(1) 
y = g(x,y), 


where it will be assumed that the right-hand sides are twice continuously 
differentiable, and, for the purposes of Theorem 23, that they are three 
times continuously differentiable. 

Nondegenerate states of equilibrium. Since the state of equilibrium can 
always be taken as the origin, we shall assume in what follows that the 
state of equilibrium of system (1) is the origin. Lincarizing system (1) 
at the point (0, 0), that is, expanding the right-hand sides of system (1) 
into Taylor scries in x and y and discarding second-order terms, we obtain 


the linear system 1 : 
z= ax + agy, 
(2) 


y= az + aay. 


Let d and u be eigenvalues of the matrix (ai). The state of equilibrium (0, 0) 
of system (1) is called nondegenerate if the numbers ) and pu are not equal 
and if their real parts are not zero. The behavior of trajectories of the linear 
system (2) was studied in detail in §16. Here it will be shown that for 
the nondegenerate state of equilibrium the behavior of trajectories in the 
neighborhood of the state of equilibrium (0, 0) of system (1) essentially 
coincides with the behavior of trajectories in the neighborhood of the 
state of equilibrium (0, 0) of system (2). 

We shall retain the notation of §16 for the state of equilibrium (0, 0) 
of (1). If the numbers A and yu are both real and negative, then the state of 
equilibrium is called a stable node. If the numbers \ and p» are both real 
and positive, then the state of equilibrium is called an unstable node. If 
\ and uw are complex conjugate and have negative real parts, then the state 
of equilibrium is called a stable focus. If \ and mw are complex conjugate 
and have positive real parts, then the state of equilibrium is called an 
unstable focus. Iinally, if \ and uw are real and are of opposite sign, then 
the state of equilibrium is called a saddle. 

The simplest properties of the behavior of trajectories in the neighbor- 
hood of the state of equilibrium can be established directly on the basis of 
the Lyapunov theorem (Theorem 19) and proposition (I°) of §26. Thus 
we obtain the following proposition. 

(A) A stable node and a stable focus are asymptotically stable states of 
equilibrium. An unstable node and an unstable focus are completcly 
unstable states of equilibrium. 
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This proposition already solves to a considerable degree the problem of 
the behavior of trajectories in the neighborhood of a node or of a focus. 
Actually, if it is known that a given state of equilibrium is asymptotically 
stable, then from the point of view of applications it often does not mat- 
ter how the trajectories approach it. The same also applies to the com- 
pletely unstable state of equilibrium. An entirely diffcrent role is played 
by the saddle: if we know the behavior of trajectories in its neighborhood, 
we can describe the behavior of the trajectories in the entire plane. At the 
same time, the theorem on the behavior of a trajectory in the neighborhood 
of a saddle is considerably more difficult to describe than the corresponding 
theorems concerning the node and the focus. 

Let us perform a lincar transformation of the coordinates in the phase 
plane of system (1) in order to give (1) its simplest form. 

(B) Expanding the right-hand sides of (1) in Taylor series in x and y 
up to second-order terms, we obtain 


= aj + ay + 7(z, y), 3) 


I 


y = ajc + agy + s(x, y), 
where the remainders r(x, y) and s(x, y) vanish together with their first 
derivatives with respect to x and y at the point x = 0, y = 0 and can be 
written in the form 
r(x, y) = rye? + 2ryory + rooy’, (4) 
s(x, y) = 81127 + Qsyory + sooy”, 


where the coefficients r;; and s,; of these “quadratic forms” are functions of 
x and y which are bounded in the neighborhood of the origin. It is found 
that by performing a real linear transformation of z and y into £, 7, the 
system (3) can be reduced to a simple form in which two cases are to be 
distinguished: (1) If the eigenvalues \ and u of the matrix (a}) are real 
and distinct, then the system of equations for & and 7» may be written in 


the form ; 
£= E+ p(E,n), 97 = wn +oa(£, 7). (5) 


(2) If the eigenvalues of (a}) are complex conjugate, i.c., if they have the 
form » + wz and uw — 1, then the system of equations for & and 7 may be 
written in the form 


E=pi—vnt+ (in), 2 = vetunt+o(é,n). (6) 


In both cases the remainders p(£, 7) and o(£, 7) have the same properties 
which were mentioned above for the functions r(x, y) and s(x, y). In the 
first case the system takes the form (5) if the directions of the eigenvectors 
of the matrix (a;) are taken as axes. 


246 STABILITY [cHapP. 5 


To prove proposition (B) it is sufficient to find a linear transformation of 
the coordinates z and y into the coordinates £ and » such that (2) assumes 
a simple form. Such a transformation has already been found [see §14, (F)]. 
By applying the same transformation to system (3), we obtain system 
(5) or system (6). 

Behavior of trajectories in the neighborhood of a saddle. 


THEOREM 22. Let us assume that the state of equilibrium O = (0, 0) 
of system (1) is a saddle. Let P be a straight line passing through the 
point O in the direction of an eigenvector of the matrix (ai) with a 
negative eigenvalue, and let Q be a straight line passing through the 
point O in the direction of an eigenvector of the matrix (a) with a positive 
eigenvalue. Then (Tig. 63) there exist exactly two trajectories U; and 
U2 of system (1) which tend asymptotically to the point O as { > +o. 
These trajectories, together with the point O, form a continuous differ- 
entiable curve U which is tangent to the straight line P at the point O. 
In the same way there exist exactly two trajectories V; and V2 of 
system (1) which tend asymptotically to the point O as t > —o; 
these trajectories, together with the point O, form a continuous differ- 
entiable curve V which is tangent to the straight line Q at the point O. 
The remaining trajectories of system (1) passing through the neighbor- 
hood of the point O behave, in general, in the same way as in the case 
of a linear equation (see §16). 

The trajectories U,; and U2 are called stable branches of the saddle O, 
and the trajectories V; and V2 are called unstable branches of the saddle O. 


Proof. First of all we shall take the line P as the axis of abscissas and 
the line Q as the axis of ordinates; then system (1) may be written in the 
form (5). Adopting again the notation x and y in place of £ and 7, we ob- 
tain the system of equations 


& = f(x,y) = 4x + rz, y), 
y = g(x,y) = wy + s(z, y), 


(7) 


where r(z, y) and s(x, y) have the form (4); here’ < 0,4 > 0. We shall 
note for what follows that in the proof below we shall use only the follow- 
ing properties of the right-hand sides of (7): they are continuously differ- 
entiable with respect to x and y and their functions r;; and s,; [see (4)] are 
bounded in the neighborhood of the origin. 

The proof will be divided into two principal parts: (a) the proof of the 
existence of the branch U; which approaches the point O along the positive 
part of the axis of abscissas when the coordinate x decreases; and (b) the 
proof of its uniqueness. The existence and uniqueness of the branch U2 is 
proved analogously. To examine the branches V; and Vz it is sufficient 
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to change the sign of the time ¢; in this connection, stable branches will go 
over into unstable branches and conversely. 
Let us turn to the proof of the existence of the branch U,. For this we 
shall set 
w(x, y) =y— ax”, a> 0, 


and we shall consider the parabola in the xy-plane defined by the equation 
w(x, y) = 0. (8) 


The parabola (8) divides the plane into two parts: a positive part, contain- 
ing the positive semiaxis of ordinates, and a negative part. The positive 
domain will be the interior of the parabola. First we shall show that if a 
is a sufficiently large positive number and z is sufficiently small (|x| < 6), 
then all trajectories of system (7) (except the state of equilibrium O) which 
intersect that part of the parabola (8) for which |x| < e€ pass from the 
negative side into the positive, i.e., from the outside to the inside of the 
parabola (lig. 64). J*or this we shall calculate the derivative a7)(x, y) 
of the function w(x, y). System (7) applied to the points of the parabola 
(8) yields 


q(x, at?) = ¥ — 2art = a(u — 2A)x? + sya? +--- 


(here the unwritten terms contain x to at least the third power). The 
number uw — 2) is positive and the function s,; is bounded in the neigh- 
borhood of the origin; therefore a number a can be chosen so large that 


a(u _— 2X) _ |s11| > 4, 6 > 0. 


The omitted terms in the expression for «,7)(v, av”) are at least third- 
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order terms in x and therefore a positive € exists such that for |z] < e 


we have 9 
a7) (2, aN ) 2 0, 


where the equality sign holds only for x = 0, that is, at the point O. 
T'rom what has been proved it follows that all trajectories of system (1) 
except the state of equilibrium O intersect that section of the parabola (8) 
in the direction of increase of the function w(x, y), that is, from the outside 
inwards. 

In exactly the same way, it can be proved that the section |x| < € of 


the parabola 
y+ ax? = 0 (9) 


is intersected by all trajectories of the system (7), except the state of 
equilibrium O, from the outside inwards [the interior section of the parab- 
ola (9) contains the negative semiaxis of ordinates (Fig. 65)]. 

Let a and b be the points at which the straight line + = e€ intersects the 
parabolas (8) and (9), respectively. We consider the triangle [0, a, }] 
formed from two sections of the parabolas (8) and (9) and the line segment 
(a, b]. If € is sufficiently small, then all trajectories of (1) passing through 
the triangle [O, a, b] go from right to left (J*ig. 66); in particular, they inter- 
sect. the segment [a, b] from right to left in entering the triangle [0, a, b]. 
This follows from the fact that the expression 


£=drdAr4+ r(x, y) 


[see (7)] is negative for 0 < x < ¢, [y| < az?, since \ < 0, and r(z, y) 


is a “quadratic form” in + and y with bounded coefficients. 
Let g(t, p) be a trajectory of system (7) starting at a certain point p 
of the interval (a, b) at £ = 0. This trajectory enters the triangle [O, a, }] 
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through the side [a, b]. With an increase in é, it can cither go out of the 
triangle through the parabolic arcs Oa, Ob, or not go out of the triangle at 
all. In the latter case the trajectory approaches the point O asymptotically 
asi— o. It is geometrically obvious that if the trajectory y(t, p) leaves 
the triangle through the arc Oa, then the trajectory g(t, p’), where p’ is a 
point of the interval (a, p), also leaves the triangle through the arc Oa 
(lig. 66). Furthermore, if the trajectory g(é, p) leaves the triangle through 
the arc Oa, then by the integral continuity theorem (Theorem 16) the tra- 
jectory v(t, p’’), where p” is a point sufficiently close to p’, also leaves 
through the arc Oa. Thus the set of all such points p of the interval (a, b), 
for which the trajectory g(t, p) leaves the triangle through the arc Oa, 
forms a certain interval (a, a’). (This interval is nonempty, that is, a’ ~ a, 
since trajectories starting in points p which are sufficiently close to a 
obviously intersect the arc Oa.) In exactly the same way, the sct of all 
such points p for which the trajectory g(t, p) leaves the triangle through 
the side Ob forms an interval (6, 6’). The intervals (a, a’) and (b, b’) 
cannot intersect, so that the point a’ is located above the point 0’, or in 
the extreme case, coincides with it. (Actually, they coincide, but this 
requires a comparatively complicated proof.) Thus the segment [a’, b’] 
contains at least one point, so that there exists a trajectory y(t, po), 
starting on the segment [a’, b’] and approaching asymptotically the point O. 

The tangent to the trajectory g(t, po) at the point (x, y) has the slope 


_— wy tsz,y). 
K(x, y) = hz + r(a, y) 


Since the point (a, y) of the trajectory y(t, po) belongs to the triangle 
[0, a, 6], 
lyl < ax”, O<4<e, (10) 


whence it follows that the number k(z, y) remains finite and tends to zcro 
asx — 0. On the other hand, the slope [(x, y) of the secant drawn from the 
point O to the point (a, y) of the trajectory g(t, po) is equal to y/x, and 
since inequalitics (10) are valid, we have I(x, y) —- 0 asx — 0. Thus the 
curve g(t, po), based on the point O, has at O a continuous derivative and 
is tangent to the axis of abscissas. The trajectory ¢(t, po) represents 
the branch U,. The branch Us, which approaches the point O along the 
negative part of the axis of abscissas, is also tangent to the axis of ab- 
scissas at the point O. Both these branches together form a curve U with 
the equation 

y = u(x), (11) 


where u(x) is a continuous and continuously differentiable function of z, 
with u’(0) = 0. 
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Thus we have proved the existence of stable branches U; and Ug, 
which together with the point O form the curve U defined by equation (11). 
We shall now prove the uniqueness of these branches. For this we shall 
transform the coordinate system in the neighborhood of the origin of the 
xy-plane so that the curve (11) becomes the axis of abscissas. To do this, 
we substitute a new unknown function z in place of y by the formula 


y = u(x) +2. (12) 
If we substitute (12) into (7), we obtain the new system of equations 


f(x, u(x) + 2) = F(z, 2), 
2 = g(x, u(x) + 2) — u'(a)f(z, u(x) + 2) = (2, 2), 


£ 


(13) 


where the unknown functions are x and z. Since u(x) has a continuous 
derivative, the function F(z, z) has continuous derivatives with respect 
to the two variables x and z, and G(z, z) is continuous in x and has a 
continuous derivative with respect to z. However, the existence of a con- 
tinuous derivative of the function G(x, z) with respect to x has not been 
established. Thus it has not been established that our usual hypothesis 
concerning the continuous differentiability of the right-hand sides with 
respect to all the unknown variables holds for system (13). It is obvious, 
however, that to every solution of system (13) corresponds by (12) a solu- 
tion of (7), and conversely. Thus the behavior of the trajectories of (7) can 
be determined by the behavior of the trajectories of (18). 

The stable branches U, and U2 of system (7) were transformed into 
segments of the axis of abscissas of the xz-plane, so that (13) has solutions 
in which the function x varies monotonically in a certain manner and 
approaches zero asymptotically, and the function z is identically zero. 
From this it follows that 

G(x, 0) = 0. 


Below we shall show [see (C)] that the function G(x, z) can be written in 


the form 
G(x, z) = 2H(z, 2), (14) 


where H(z, 2) is a continuous function of x and z. It follows from (14) and 
the continuity of the function H(a, z) that 


aG(a, z) 


e = lim G(0, z) — GO, 0) _ lim G(0, 2) 


0 z—30 z—0 z—0 @ 


= lim H(0,z) = H(0, 0). 
z—0 
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But by (7) and (13) we have dG(s, z)/dz|,-5 = uy, so that 
z=0 


A(O, 0) = up. 
Thus the second equation of (13) has the form 
z= eH(z, 2), 


where H(z, z) is close to w in the neighborhood of the origin and is therefore 
positive. Hence, in the neighborhood of the origin it follows that, along 
every trajectory which is distinct from the branches U, and Ug, the coordi- 
nate z does not change sign and its absolute value increases as é increases. 
Thus, no trajectory running outside the axis of abscissas of the xz-plane 
can approach the point O asymptotically, and the uniqueness of the 
stable branches U; and U¢ is proved. 

It has been proved that on the interval (a, b) there exists only one point 
po such that a trajectory of (7) starting from pp approaches the point O 
asymptotically as { > oo, forming the branch U,. If the point p lies in 
the interval (a, po), then the trajectory starting from p intersects the arc 
Oa. If p lies in the interval (6, po), then the trajectory starting from p 
intersects the arc Ob. 

From the parabolas 

xz — ay? = 0, (15) 
z+ ay? =0 (16) 
and the straight line 
= € 


(Fig. 67), we can form a triangle [O, c, d] with properties similar to those of 
the triangle [O, a, b]. There exists only one point qo in the interval (c, d) 
such that the trajectory from go approaches asymptotically the point O 
for decreasing ¢ and forms the unstable branch V,. If the point g is located 
on the interval (c, go), then the trajectory from gq intersects the are Oc 
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for decreasing ¢. If g is located on the interval (qo, d), the trajectory from q¢ 
intersects the arc Od for decreasing 1. 
Let us now consider the curve 


F(z, y) = 0 (17) 


(see (7)]. It is easy to see that it is tangent to the axis of ordinates at the 
point O. Since the function f(z, y) has continuous second derivatives, so 
that the curve (17) has a definite radius of curvature at the point O, 
we can find a number a so large and a number € so small that on the 
interval |y| < € the curve (17) passes between the parabolas (15) and 
(16) (lig. 68). To the right of the curve (17) the function f(z, y) is negative, 
so that the phase velocity vectors at points located to the right of the 
curve (17) are directed to the left. Let us draw a vertical segment [e, e] 
from the point c, the lower end e of which is located on the branch U,. 
Let p be a point of the interval (a, po). If the point 7p is sufficiently close to 
po, then by the integral continuity theorem (see §25, Theorem 16) the 
point, after leaving p, will pass sufficiently close to the origin of coordinates 
to intersect the segment [c, e]. As it moves farther, it must necessarily 
intersect the arc Oc. Indeed, if the moving point intersects the line (17), 
then it must intersect the arc Oc before this. If the moving point does not 
intersect the line (17), then it will always move to the left, and the vertical 
distance z from this point to the line (11) increases; in this case, therefore, 
the trajectory intersects the arc Oc. Thus, the trajectory enters the tri- 
angle [O, c, d]. After this, the trajectory will have to intersect the interval 
(c, Yo) at some point g. Conversely, if from the point q’ of the interval 
(c, go) we follow the trajectory in the direction of decreasing f, then for 
sufficiently close points q and qo this trajectory intersects the interval 


30] EQUILIBRIUM OF AN AUTONOMOUS SYSTEM 253 


FicurE 69 


(a, po) at some point p’ since it passes close to the origin (Tig. 69). Com- 
bining these two facts, it is easy to conclude that as p — po we have g > qo. 
This gives a complete qualitative idea concerning the behavior of tra- 
jectories in the neighborhood of a saddle. Thus Theorem 22 is proved. 
We shall now prove property (14) of the function G(a, z). 
(C) Let G(a, z) be a continuous function defined in the neighborhood of 
x = z= Oand having a continuous derivative (0/dz)G(a, z). If 


G(z, 0) = 0, 


then 
G(x, z) = zH(z,z), 


where A(x, z) is continuous. 
To prove (C) we define the function H(z, z) by setting 


H(a, 2) = G2, 2) for z2#0, 
° (18) 


H(az,2) = 3 G(x, 2) for z= 0, 


and we show that this function is continuous. For z ¥ 0 the function 
defined by (18) is obviously continuous. We shall prove that it is con- 
tinuous at the point (a9, 0). We have 


G(x, 2) = G(x, 2) — G(x, 0) = ze G(x, 62), 


where 0 < @ < 1. Since the function (0/dz)G(a, z) is continuous, we 
have G(a, z)/z = (0/dz)G(x, 6z) > (8/0z)G(xo, 0) for + — x0, 2 > O(z ¥ 0). 
Thus proposition (C) is proved. 

Behavior of trajectories in the neighborhood of a node and of a focus. The 
study of the node and of the focus is considerably simpler than that of 
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the saddle. Here it is sufficient to consider only the case of stability, 
since unstable nodes and foci are obtained from stable ones by changing 
the direction of the time flow. The principal method in the study of the 
node and the focus is the introduction of polar coordinates. 


Theorem 23. Let O = (0,0) be a stable node of system (1) with eigen- 
values \ and yw, where u < X < 0. We draw through O a straight line P 
in the direction of an eigenvector with eigenvalue \ and a straight line Q 
in the direction of an eigenvector with eigenvalue u. Then every tra- 
jectory starting sufficiently close to the point O approaches O asymptoti- 
cally and has a tangent at the point O. Here only two trajectories are 
tangent to Q, and, in approaching the point O from the opposite side, all 
the remaining trajectories are tangent to P. When the node 
(0 <A < yp) is unstable, the behavior of the trajectory as t > — 020 is 
similar. 


We shall assume the existence of the third derivatives of the right-hand 
sides of (7). By proposition (B), system (1) can be written in the form (5); 
again denoting the variables § and » by x and y, we obtain the system 


t= f(z, y) =r + r(x, y), 
y = g(x,y) = wy + s(x, y). 


(19) 


Here the functions r(z, y) and s(x, y) are three times continuously differ- 
entiable and vanish at the point O together with their first derivatives with 
respect to x and y. 

Let us now introduce polar coordinates, i.e., we assume that 


x= pcosg, y = psing. (20) 
Differentiating (20) and substituting the result into (19), we obtain 
pcos ¢ — pe sin g = Apcos¢ + r(pcos¢,p sin ¢), 
psin ¢ + pe cos v = up sin g + s(p cos g, p sin 9). 
Solving the relations obtained for 6 and ¢, we obtain 
6 = p(dcos* ¢ + usin? v) + F(p, @), en 
pe = (uw — A)psin ¢cos gy + G(p, ¢), 
where the functions 
F(p, ¢) = cos ¢- r(p cos ¢, p sin g) + sin ¢- 8(p Cos g, psin ¢), 


G(p, ¢) = —sin ¢-7(p cos ¢, psin ¢) + cos ¢- s(p cos ¢, p sin ¢) 
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are periodic in ¢ with period 27, three times continuously differentiable 
with respect to p and ¢, and vanish at p = 0 together with their first 
partial derivatives with respect to p, giving 


aF(0, ¢) _ 9G(0, ¢) _ 


FO, ¢) = G0, ¢) = ap Bp 


0. (22) 
By proposition (D) cited below, the function G(p, ¢) can be written in 
the form 


G(p, ¢) = pH(p, ¢), 


where H(p, v) is a function which is twice continuously differentiable with 
respect to p and ¢ and which vanishes at p = 0 for arbitrary ¢ [see (32) 
and (22)], giving 


H(0, ¢) = 0, (23) 
so that 

dH(0, ¢) _ 

aC 0. (24) 


Dividing the second of relations (21) by p we obtain the system 


6 = p(drcos* » + usin? ¢) + F(p, ¢), . 
(25) 


¢ = (uw — d)sin gcos¢ + H(p, ¢). 

We shall consider the system (25) on the phase plane of the variables p 
and ¢, plotting ¢ on the axis of abscissas and p on the axis of ordinates. 
The systems (19) and (25) are by no means equivalent, since the trans- 
formation (20) of the ry-plane into the p¢g-plane is not one-to-one; never- 
theless, from the behavior of the trajectories of (25) we can draw certain 
conclusions about the behavior of the trajectories of (19). The behavior 
of the trajectories of (25) will be considered only in the strip |p| < e. 
Let us first find the state of equilibrium of the system (25). I'rom the first 
equation of (25) it is evident that for sufficiently small p ~ 0 the value 
of p is not zero [see (22)], and therefore in the band |p| < € all states of 
equilibrium lie on the axis p = 0. After this we find all states of equilib- 
rium from the second equation of (25) [see (23)]: 


p= 0, ¢=> k= 0, +1, +2,.... 
If we linearize the system (25) at the point p = 0, ¢ = k7/2, we obtain 


Ap = ux Ap, 
Ag = (u — 2): (—1)* Ay + ay Ap, 
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where yu; (which is equal to \ for even values of k and to uw for odd) is a 
negative number. Thus the point p = 0, ¢ = k7/2 is a stable node of 
(25) whenever k is even and a saddle when k is odd (Fig. 70). Here the 
unstable branches of a saddle are directed along the y-axis and the stable 
branches along the curves which approach the saddle from above and from 
below (sce Theorem 22). 

We shall now show that for sufficiently small positive ¢, every solution 
of system (25) which starts in the strip |p| < ¢ either is a stable branch 
of one of the saddles of the system (25) or, by not leaving the strip 
\o| < €, approaches asymptotically one of the nodes of (25). 

To every state of equilibrium p = 0, yg = k7/2 we shall make corre- 
spond its neighborhood U;, defined by the inequalities |p| < 4, 
le — kw/2| < 68, where 6 is a positive number. If & is even, then the 
state of equilibrium under consideration is a stable node, and by virtue of 
its asymptotic stability there exists a positive number 6 so small that 
every solution which starts in the neighborhood U;, approaches the node 
asymptotically. If & is odd, then the corresponding state of equilibrium 
is a saddle, and there exists a positive number 6 so small that a solution 
different from the state of equilibrium and starting at U; either describes 
the stable branch of a saddle or leaves the neighborhood U;, (see Theorem 
22). Since the right-hand sides of (25) are periodic in ¢, it is possible to 
select a positive 6, which is common for all neighborhoods U;. Now it is 
possible to select a small positive number € < 6 such that in the strip 
|p| < e€ the right-hand side of the first of the equations (25) has a sign 
opposite to that of p, so that for every solution which starts in this strip, 
the quantity |p| decreases. Further, for fixed 6 we can choose a small 
positive number e€ such that in the rectangle 


lp|l <«, za tises 
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the right-hand side of the second of the equations (25) does not change 
sign, and its absolute value does not exceed some positive number a, 
so that the solution starting in this rectangle leaves it within a period of 
time not exceeding the number 7/2a and enters that neighborhood U;, 
or Ux41 corresponding to the stable node. By the periodicity of the 
system (25) in y, the number e can be taken to be the same for all rec- 
tangles under consideration. 

We now see that for this value of € every solution which starts in the 
strip |p| < € either passes through the stable branch of the saddle or 
approaches the stable node asymptotically. 

To every solution of system (25) which starts in |p| < e€ there corre- 
sponds a solution of system (19) which starts at a distance less than e 
from the stable node O of this system. In order to obtain all such solutions 
of (19), it is sufficient to consider only solutions of (25) starting at 
0 < p<. By the periodicity of the system (25) in y and by the peri- 
odicity of the transformation (20), there exist only two solutions of (19) 
corresponding to stable branches of saddles of the system (25) which pass 
through for p > 0, and these solutions of (19) approach the state of 
equilibrium O asymptotically, being tangent to the straight line Q and 
approaching O from opposite sides. To those solutions of (25) which tend 
to stable nodes correspond solutions of (19) which tend to the state of 
equilibrium O and which are tangent to the straight line P as they ap- 
proach 0. Thus Theorem 23 is proved. 


THEOREM 24. Let us assume that the origin O of system (1) is a focus, 
i.e., that the eigenvalues of the matrix (a}) are the complex conjugate 
numbers 

A=pe+wD, j=p— Wy, 


where up ~ 0, vy = 0. Thus if uw» < 0, then as t — ~ all trajectories 
passing through some neighborhood of the point O spiral around the 
origin; if uw > 0, then as t > —o all trajectories passing through a 
neighborhood of the point O spiral around the origin O. 


Proof. For the proof we shall use the canonical form (6) by rewriting the 
variables § and 7 as x and y. Thus it is necessary to study the system of 
equations 


£= f(x,y) = wx — vy + r(x, y), (26) 
y = g(x,y) = ve + my + 8(2, y). 


We introduce polar coordinates, i.c., we set 


x= pcosg, y = psing. (27) 
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Differentiating (27) and substituting the expressions obtained into (26), 
we have 


pcos ¢ — pesin eg = upcos »y — vpsin g + r(p cos ¢, psin¢), 
psin ¢ + pe cos y = vp cos ¢ + upsin gy + s(pcos yg, psin g). 
Solving the system of equations obtained for 6 and ¢, we obtain 


6 = up + p*p(p, ¢), 
(28) 
¢=v-+ pap, ¢), 


where p(p, v) and q(p, ¢) are functions which are bounded for small p 
and periodic in g with period 27. To be specific, we shall assume that 
u <0. We shall consider the trajectory of system (28) which starts at 
the point (po, go), where 0 < po < € and € is a sufficiently small number. 
From equations (28) it follows that this trajectory approaches asymptoti- 
cally the axis p = 0, where ¢ tends cither to +0 or to —o, depending 
on whether » is positive or negative. Hence it follows that the correspond- 
ing trajectory in the ry-plane spirals around the origin. Thus Theorem 24 
is proved. 

The following proposition (D), which is essentially a generalization of 
proposition (C) proved above, is used only in proving Theorem 23. 

(D) Let G(p, ¢) be a function defined in the domain W which is given 
by the inequalities |p| < €, 8: < ¢ < 2 and satisfies the condition 


G(0, ¢) = 9, (29) 
and which has the property that the function 
dG(p, ¢) 
op 


exists and has continuous partial derivatives up to order r inclusive. 
Then in the domain W the function G(p, ¢) can be written in the form 


G(p, ¢) = pH(p, ¢), (30) 
where the function H(p, ¢) is defined by the equations 


H(o, ¢) = Cee), for p ¥ 0, 
(31) 
_ 9G(0, ¢) 


H(0, ¢) ap 


and has continuous partial derivatives up to order r inclusive in the domain 
W. [For r = 0, proposition (D) reduces to proposition (C)]. 
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To prove (D) we consider the function 


K(p,¢) = "R28, o<s<r, (32) 


which has continuous partial derivatives in the domain W with respect to 
p up to order s + I inclusive and satisfies the condition 


K(0, ¢) = 0 (33) 
[see (29)]. We shall prove that for p ~ 0 the inequality 
ad (K(p, ¢) 4 OtTK (9:0, ¢) 


t=0 


holds, where, for every fixed k, the numbers Yo, 7¥1,..., Y; do not depend 
on the function G(p, ¢) and satisfy the condition 


1 - 
Yor Mn be + Me = FTG (35) 
and the numbers @, 61,..., 6, satisfy the inequalities 
0<6@< 1, 7=0,1,...,h. (36) 
Calculating the derivative (d"/dp*)(K(p, ¢)/p) by the Leibniz formula, 
we obtain 
a" K(p, ¢) __ 1 x a aK (p, ¢) 
apk ( p ) prt d OP api’ (37) 
where, for every fixed k, the numbers ao, a1, . . . , a, do not depend on the 
function G(p,¢). Expanding each of the functions 0'°K(p, ¢)/dp’, 
«= 0,1,..., 4%, in Taylor series in powers of p, we obtain 


aK(p,¢) _ &KO,¢) , p a 'K@,¢) 


Opi dpi 1! apiti tee: 
4 er KO ¢) | pet! at K (6:0, 0), 
(k— i)! dpe (k—i+1)! peri 


(38) 
Furthermore, on substituting (38) into (37), we obtain by virtue of (33) 


a* (K(p, ¢) 1 ,9°K(, ¢) * 2410°*'K (8p, ¢) 
ap* ( p ~ pet > bip Op? + > V5 Opk+1 ; 


(39) 
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where b; and Y; are constants which, for every fixed k, do not depend on the 
choice of the function G(p, ¢). 

To prove the relations (34) it is sufficient to establish that the constants 
by,..., b, equal zero and that the constants Yo, ¥1,..., ¥% satisfy condi- 
tion (35). Since these constants do not depend on the choice of the func- 
tion G(p, ¢), it is sufficient to establish these properties for any functions 
G(p, ¢) whatsoever. We consider the case when G(p, ¢) is the polynomial 


es k+1 a 3 
Coed = Goa Lae (40) 
By (82) we find that 
a* K(p, ¢) Ak+1 
ap ( p ~~ k+1 (41) 


On the other hand, the identity (39) for the polynomial (40) has the form 


a* /(K - 1 k ; k 


J=0 


The right-hand sides of (41) and (42) must coincide for |p| < €, p ¥ 0, 


but since the numbers ai,..., x41 are arbitrary, it follows that the 
numbers b,,...,b, are equal to zero, and (35) then follows. Thus 
formula (84) is proved. 
We now consider the function L;(p, ¢), = 0,1,..., 8s, and set 
k 
tale, o) = 2g (E@*), for p 0, 
1 at'KO 
L,(0, ¢) = qv* K(, ¢) | 


k+1 Opk+1 


From (34) and (35) it follows that L;(p, ¢) is a continuous function of both 
p and ¢ in the entire domain W. It is clear that for p ¥ 0 the inequalities 


dLi(p, ¢) | k=0,1,...,s—1, (44) 


Lyi(p, ¢) = ap 


hold. Let us prove that the equalities are also valid for p = 0. Let 
0 < po < 6,0 < p < e; then we have 


p 
Li(p, ¢) = Lx(po, ¢) +f En4i(&, ¢) dé. (45) 


Since the functions on both sides are continuous, this equality is also valid 
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for p = 0, so that we have 
0 
Lx(0, ¢) = Le(Po, ¢) +f Ln4i(&, ¢) dé. (46) 
Po 


Subtracting (46) from (45) and dividing the result by p, we obtain 


Lip, ¢) — Li(0, 9) _ fy Een 0) ae 
p p 


p> 0. 


Passing to the limit as p — 0, we see that the right-hand derivative of the 
function Lz(p, ¢) with respect to p at the point p = 0 exists and is equal 
to Ly41(0, ¢). In exactly the same way we can prove that the left-hand 
derivative is also equal to Lz41(0, ¢). Thus (44) is valid in the entire 
domain W. 

From (43) and (32) with k = 0 and s = r we obtain (30) and (31), and 
from (44) and (32) it follows that the function H(p, ¢) has the continuous 
derivative 


gt tke p, 9 

ae 0O<k<s, 0O<s<r, 
which means that the function H(p, ¢) has continuous partial derivatives 
up to order r inclusive. Thus proposition (D) is proved. 


31. Stability of periodic solutions. In this paragraph we shall study the 
problem of stability of periodic solutions of autonomous systems and of 
systems with periodic right-hand sides. 

The concept of stability. In §26 we gave the definition of Lyapunov 
stability of a state of equilibrium of an autonomous system. First of all, 
we shall give here a definition of Lyapunov stability of the solution of an 
arbitrary system of equations. 


Let 
x = f(t, x) (1) 


be the vector form of an arbitrary normal system of nth-order equations 
whose right-hand sides, together with their derivatives df‘(t, x)/dx’, are 
defined and continuous in a certain domain I of the space of the variables 
t,x. The solution of equation (1) with initial values 6, — will be denoted by 
e(t, 6, £). 

Definition. A solution g(t) of equation (1) with initial values to, Xo is 
called Lyapunov stable if the following conditions are fulfilled: (1) There 
exists a positive number p such that, for |x — Xo| < p, the solution 
g(t, to, X1) is defined for all values ¢ > to; in particular, the solution g(¢) it- 
self is also defined for all t > éo. (2) For every positive number e, a positive 
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number 6 < p can be found such that for |x; — xo| < 6 we have 
let, fo, X1) — o()| < € for ¢ > ty. The solution g(t) of equation (1) 
which is Lyapunov stable with initial values to, x9 is called asymptotically 
stable if a positive numbero < pcan be found such that for |x, — x9| <¢ 
we have 

lp(t, to,%1) — p®| 70, as too. 


The definitions presented here of Lyapunov stability and of asymptotic 
stability are invariant with respect to the choice of initial values ¢o, Xo of 
the solution g(t). This can easily be derived from Theorem 18. 

In the particular case when the system (1) is autonomous and the solu- 
tion g(é) is the state of equilibrium, the definitions of stability given here 
coincide with those given in §26. 

We shall now consider systems (1) whose right-hand sides are periodic 
functions of ¢ with period 7, 


fi¢+7,x) = f(t,x), (2) 

and also systems (1) which are autonomous 
f(é, x) = f(x). (3) 
In both cases the problem of stability of a periodic solution with period 7, 
g(t +7) = of), (4) 


will be investigated, and, in the case of an autonomous system, the periodic 
solution will be assumed to be different from the state of equilibrium. In 
the case of the periodic system [see (2)], sufficient conditions for the asymp- 
totic stability of the solution (4) with period 7 will be given. The autono- 
mous: system is a particular case of the periodic system, so that we might 
expect that these conditions apply also to a periodic solution of the auton- 
omous system. It turns out, however, that they are not fulfilled (a periodic 
solution of an autonomous system cannot be asymptotically stable), so that 
for the Lyapunov stability of a periodic solution of the autonomous 
system, we shall give other weaker conditions. 

(A) In order to study the behavior of solutions of (1) in the neighbor- 
hood of the solution g(t), we shall introduce a new unknown vector func- 
tion y by setting 

x = g(t) + y. (5) 


In what follows we shall assume that the right-hand sides of (1) have 
continuous second-order derivatives in the domain with respect to the 
coordinates of the vector x. Replacing the variables in system (1) by (5), 
using the fact that ¢(¢) is a solution of (1), and expanding the right-hand 


31) STABILITY OF PERIODIC SOLUTIONS 263 


sides in powers of y, we obtain 


y= x Feo) 0) yi + Gy). (6) 


3 
By linearizing this system, i.e., by discarding the terms r* which are at 
least of the second order in y, we obtain the linear system 


y = Ady, (7) 


where A(é) is a matrix with elements 


a’ ia) = af'(t, ep) iC (0 ). 


We shall now assume that the right-hand side of (1) is a periodic function 
of ¢ with period 7 [see (2)] and that the solution g(Z) is also periodic with 
period 7. Under these assumptions, the linear system (7) is periodic with 
period rf: 

ai(t + 7) = a;(t), j= 1,...,%, 


so we can speak about its characteristic numbers [see §19, (E)]. Thus 
whenever (1) is autonomous [see (3)] and its periodic solution g(t) is not 
a state of equilibrium, the linear system (7) necessarily has one char- 
acteristic number equal to unity. 

Let us prove this last assertion. Let Y(é) be a matrix which satisfies the 


matrix equation 
WY = A(é)v 


with the initial condition 
W(to) = E, (8) 


and let C be the fundamental matrix of the solution ¥(é) [see 19, (A)], so 
that vi+r) = VOC. (9) 
It is seen directly that any solution ¥(é) of (7) may be written in the form 
H(t) = VO) (to). 
From this and from (8) and (9) it follows that 
W(to + 7) = C(to). (10) 


Remembering that (1) is autonomous, we have [see (3)] 


o(t) = f(e(), 
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which, when differentiated with respect to é, gives 


p(t) = Al). 


Thus the vector function g(t) satisfies the vector equation (7). But o(t) 
is periodic with period 7, so that from (10) we obtain 


p(to) = o(to + T) = Colto), (11) 


and since (to) * 0 because g(t) is not a state of equilibrium, it follows 
that the matrix C has an eigenvalue equal to unity and consequently one 
of the characteristic numbers of (7) is equal to unity. 

The theorems of Lyapunov and Andronov-Witt. Now we can formulate 
sufficient conditions for the stability of the periodic solution ¢(é) whenever 
system (1) is periodic and whenever it is autonomous. 


THEOREM 25. Let equation (1) be periodic in ¢ with period 7 [see (2)]}, 
and let y(t) be a periodic solution of (1) also having period 7 [see (4)]. 
If the absolute values of all characteristic numbers of (7) [see §19, (E)] 
are less than unity, then the solution g(t) is asymptotically stable; 
moreover, there exists a number ao > 0 such that for |x, — x9| <o 
the bound 


lo(t; to, X1) — o(t)| < re“ [x1 — Xo], — t > bo, (12) 
holds, where r and a@ are two positive numbers which do not depend on x. 


THEOREM 26. Let (1) be autonomous and let g(t) be a periodic solution 
with period 7 which is distinct from the state of equilibrium. If the 
characteristic number of (7), which is equal to unity, has multiplicity 
one and if the absolute values of all remaining characteristic numbers of 
(7) are less than unity, then the solution ¢(¢) is Lyapunov stable. 


Theorem 25 was formulated by Lyapunov. Theorem 26 was derived by 
Andronov and Witt as a rather simple corollary of a rather obscure theorem 
of Lyapunov. We shall give here an alternative proof of Theorem 26, 
which will be based on Lyapunov’s method. 

We shall preface the proofs of Theorems 25 and 26 with certain con- 
structions which will prove useful. 

In §25 we gave the definition of the derivative of a certain function with 
respect to an autonomous system of equations. We shall present it here 
for the case of a nonautonomous system. 


(B) Let 
F(x) = F(z',..., 2”) 


be a scalar function of the vector variable x. We define the derivative 
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Fuy(to, Xo) of this function with respect to system (1) at the point to, Xo 
in the following manner. Let g(t) be the solution of equation (1) with 
initial values fo, Xo. If we set 


‘, d 
Fay(to, Xo) = di F(e()| 
t= 
and perform the differentiation indicated on the right-hand side, we obtain 


OF (x) 


i 
rar Ox 


Fay(t, x) = AG x). 


Whenever (1) is autonomous, the derivative F,1)(t, x) of the function F(x) 
with respect to (1) at the point ¢, x does not depend on f. 
(C) Let . 
z= Bz+ p,z) (13) 


be a normal system of differential equations in vector form, where B = (b}) 
is a constant matrix, all of whose eigenvalues have negative real parts, and 
p(t, z) is the remainder defined for t > fo, |z| < cc > 0) and possessing 


the bound 
Ip(t, z)]| < plz]? (14) 


where p= is a positive number. Thus the solution z = 0 of equation (13) 
is asymptotically stable; moreover, the estimate 


X(t, Z1)| < rizy|e—, t > to, (15) 


is valid for the solution z = X(t, Z,) with initial values to, Z1, |z1| < ¢1 < ¢, 
where r and a@ are positive numbers which do not depend on 2. 
Proposition (C) may be proved in exactly the same way as Lyapunov’s 
theorem (see §26). We shall carry out this proof without going into 
excessive detail. 
Let IV(z) be Lyapunov’s function for the linear system 


z= Bz (16) 
with constant coefficients [see §26, (I2)], so that the inequality 


aWw(z) 


tj _ 
bie) < —26W@), 8 > 0, 


W16)(z) = 
a7 
holds. Irom this inequality and from (14) we obtain for W(z) < ce the 
inequality 


Was(z) = , ae bye? + aie me) 2) S —2aW(2), 
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where a < 6 and ce are certain positive numbers. Let us set 
wt) = W(x(t,2z1)) for W(z) < ee. 
For the function w(t), £ > to, the inequality 
wt) < —2aw(t) (17) 


holds whenever the relation 
w(t) < cg 


is valid. From (17) it follows that when the inequality w(t) < cz is valid, 
the function w(t) decreases (or, more accurately, it does not increase), 
and since at the initial moment ¢t = ¢g the inequality w(t) < ce is fulfilled, 
the point X(t, z,) cannot leave the closed set F defined by the inequality 
W(z) < ce, so that the solution X(¢, z,) is defined for all values t > tg 
[see §24, (B)] and (17) holds for all these values. Assuming now that 
Zz, ~ 0, we may perform the following calculations, starting from (17): 


or, by integrating, we obtain 


In w(t) — In w(to) < —2a(t — to), 


and from this follows 
w(t) < w(to)e 240, 


or, what is the same thing, 
W (xt, Z1)) < W (ze 20? 


The estimate (15) follows directly from this estimate. Thus proposition 
(C) is proved. 
Proof of Theorem 25. By Theorem 12 there exists a transformation 


y = T(z, (18) 


where the matrix T(t) is real and has period 27, which transforms (7) into 


the equation 
z= Bz 


with a constant real matrix B. The solution of the equation z = Bz 
is the matrix e'? [see §19(C)], so that the matrix e?7* is fundamental for 
this equation as well as for equation (7). Thus by the hypotheses of 


Theorem 25 the absolute value of each eigenvalue of the matrix e?7? is 
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less than unity. But according to Theorem 27, the eigenvalues of the 
matrix e7”? have the form e?™, where \ runs through all eigenvalues of B. 
Thus |e?"| < 1, so that all eigenvalues of B have negative real parts. 
Under the transformation (18) equation (6) can be reduced to the form (13), 
and for its solution z = X(t), Z;) we obtain the estimate (15). From this 
estimate and from the fact that the matrix 7'(¢) is nondegenerate, we 
obtain the estimate (12). Thus Theorem 285 is proved. 

Proof of Theorem 26. Proceeding from the assumption that equation 
(7) has a characteristic number equal to unity and of multiplicity one and 
that the absolute values of all remaining characteristic numbers are less 
than unity, we shall show that there exists a transformation 


y= Tz (19) 


with a real matrix 7(¢) of period 27, which transforms (7) into the equa- 
tion 
z= Bz (20) 


with a constant matrix B of the form 


= (2.2) , (21) 
0 10 


where B* is a square matrix of order n — 1, all of whose eigenvalues have 
negative real parts. 
Let C be the fundamental matrix of a certain solution of the matrix 
equation [see (7)] 
Y = A(t)Y. (22) 


Since C' has an eigenvalue equal to unity and of multiplicity one, then in 


some basis it has the form 
* | 
(=) , (23) 
0 {1 


where C* is a real square matrix of order n — 1, all of whose eigenvalues 
have an absolute value less than unity [see §32, (G)]. Since the matrix 
C and the matrix (23) may be obtained from each other by a linear trans- 
formation, the matrix (23) is fundamental for some solution of (22); we 
shall assume that C coincides with the matrix (23). By proposition (D) 
of §33 there exists a real matrix B* which satisfies the condition 


2TB* __ cy%2 
e = C* 


where, by Theorem 27, all eigenvalues of the matrix B* have negative 
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real parts. It is clear that the matrix B [see (21)] satisfies the condition 
e2tB _ 2 
{see (23)]. Thus (compare the proof of Theorem 12) there exists a trans- 
formation (19) which transforms (7) into (20). 
We shall now determine those conditions which the matrix T(t) must 


satisfy in order that the transformation (19) transforms (7) into (20). 
Differentiating (19), we obtain 


y = Ti)z4+ Tt = Tt)z + T(t)Bz. 
Replacing z in this equation by the formula z = T~'(t)y, we obtain 
y= (Tt) + T)B)T Wy. 
Since this equation coincides with (7), we have 
(T(t) + TWB)T-*(t) = AO), 
and, by multiplying this relation on the right by the matrix T(¢), we obtain 
T(t) + THB = A(t)T(t). (24) 


This condition imposed upon the matrix T(t) is necessary and sufficient 
for transformation (19) to transform (7) into (20). We shall decompose 
(24) into two relations by representing the matrix 7'(¢) in the form 


T(t) = (T*(t), t®), 


where 7*(t) has n rows and n — 1 columns and where the matrix t(t) 
is the last row of the matrix T(t) and therefore is a nonzero vector. We 
obtain [see (21)] 

T*(t) + T*()B* = A(T*(O), (25) 


t(t) = A(Z)t(d). (26) 


From the last relation it is clear that t(£) is a periodic solution with period 
27 of equation (7) so that the condition 


t(to) = t(to + 27) = C?t(to) 


is satisfied [compare (10)]. Thus the vector t(t9) is an eigenvector of the 
matrix C? with an eigenvalue equal to unity. Since the matrix 


a 
C= (e°) 
0 !1 
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has unity as a simple eigenvalue and since we already know one vector 
y(to) * 0 with this eigenvalue in the matrix C? [see (11)], we have 


t(to) = Ye(to), 
and therefore 
tt) = veld), 


since both vector functions t(t), ¢(¢) are solutions of (7). From this it is 
clear that if we replace the last column t(é) of T(é) by the vector ¢(é), 
then the matrix (7*(é), $(t)) obtained again will satisfy conditions (25) 
and (26). Therefore we shall assume that 


T(t) = (T*(), oO). (27) 


Starting from (25) and (27), we shall transform the unknown function x 
of equation (1) in the autonomous case [see (3)] into the new unknown 
functions z* and s, where z* = (z},..., 2" 1) is an (n — 1)-dimensional 
vector (which will be regarded in the sequel as a matrix with one column) 
and where s is a new scalar variable. For this we set 


x = T*(s)z* + o(s) = g(z*, 8). (28) 


This transformation is periodic in s with period 27. To each pair z*, s 
relation (28) places in correspondence for sufficiently small |z*| the point 
x, which is close to the point g(s) of the periodic trajectory K defined 
by the solution x = g(t). In the neighborhood of each pair z* = 0, 
8 = Sp relation (28) is a one-to-one transformation, since the functional 
determinant of this transformation at the point z* = 0, s = So is equal 
to the determinant of the matrix T(so) [see (27)] and therefore is distinct 
from zero. The coordinate s of the pair (z*, s) will be assumed to be a cyclic 
coordinate with period 27, i.e., we shall identify the pair (z*,s) and 
(z*,s + 2r). Since the pairs (0, sg) and (0, s, + 7) are mapped by (28) 
into the same point g(sq) of the trajectory K, some neighborhoods of the 
pairs (0, so) and (0, s) + 7) are mapped one-to-one into the same neigh- 
borhood of the point g(s 9) of K. Thus, transformation (28) maps the set 
of all pairs (z*, s) (for sufficiently small |z*|) in a two-to-one manner onto 
some neighborhood of K. Here the closed curve consisting of all pairs 
(0, s), 0 < s < 27 is mapped onto the line K twice. 

We shall now substitute into equation (1) [see (3)] the unknown vector x 
according to formula (28). Substitution into the left-hand sides gives 


x= T*(s)z*s + T*(s)Z* + o'(s)8. (29) 
Substitution into the right-hand side gives 
f(x) = f(y(s)) + A(s)T*(s)z* + R(s, 2*), (30) 
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where the remainder R(s, z*) is periodic in s with period 27 and is an 
infinitesimal of the second order with respect to the vector z*. Equating 
the right-hand sides of (29) and (30), we obtain 


T*(s)t* + o'(s)8 + T*(s)2*8 = £(y(s)) + A(s)T*(s)z* + R(s, 2*). 


Replacing the matrix A(s)T*(s) according to formula (25) and substituting 
¢’(s) for f[e(s)], we obtain 


T*(s)z* + o'(s)8+ T*(s)z*8 = o'(s) + (T*(s) + T*(s)B*)z* + R(s, 2*), 
whence 
T*(s)(2* — B*z*) + (g!(s) + T*(s)2*)(8 — 1) = R(s,2*). (31) 
We now consider two new auxiliary variables, the vector 
u*(ul, 22, uu?) 


and the scalar u”. In the n-dimensional space of the variables (u*, u”) = 
(ul, u?,..., u™) we consider the linear transformation M which depends 
on the parameters s and z*, and set 


M(u*, u”) = T*(s\u* + (o's) + T*(s)z*)u". 


For z* = 0 the transformation M reduces to T(s), so that close to zero M 
is nondegenerate. Thus the equation 


M(u*, u") = R(s, 2*) 


can be solved uniquely (for z* close to zero) for the unknowns u*, u”, and 
the solution 
u* = q*(s, 2*), 


u” = q(s, Z*) 


is periodic in s with period 27 and is an infinitesimal of the second order 
with respect to z*. Since (31) can be rewritten in the form 


M(z* — B*z*,s — 1) = R(s, z*), 


we obtain 
z* — B*z* = q*(s, z*), §—1= qs, z*). 


Thus in terms of the variables z*, s, equation (1) may be written in the form 
z* = B*z* + q*(s, 2*), (32) 
$= 14+ qs, 2*). (33) 
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There now exists a positive number e€ such that for |z*| < € the re- 
mainder 9(s, z*) satisfies the inequality |g(s, z*)| < 1. When this inequal- 
ity is satisfied for every solution z* = z*(t), s = s(t), then s may be 
taken in place of t as the independent variable, and equations (32) and (33) 
may be rewritten in the form 


dz* B*z* + q*(s, z*) 


ds” —- 1+ q(s, z*) 

at 1 

ds 1+ 4q(s,z*)’ 

or 

* 
= = B*r* + &*(s, 2*), (34) 

Ss 

dt . 

ais 1+ k(s, z*), (35) 


where the remainders k*(s, z*) and k(s, z*) are periodic in s with period 
27 and are infinitesimals of the second order with respect to the vector z*. 

In the system (34), (35), s is the independent variable, but z* and ¢ 
are considered to be unknown functions of s. Equation (34) does not con- 
tain the unknown function ¢ and can be solved separately. Thus, in order 
to find the solution of the system (32) and (33) with initial values fo, 
zi, 81, it is first necessary to find the solution z*(s, z{, 81) of (34) with 
the initial values zt, s; which by proposition (C) is defined for all values 
s > s, for sufficiently small |z}| and has the estimate 


|z*(s, 27, 81)| < rlzzle"*. (36) 


After this we must find the solution of (35) with the initial values to, zf, 81; 
this solution is given by the obvious formula 


i= i) +f [1 + k(s, z*(s, zi; 81))] ds 


= to — $1 + s +f k(s, z*(s, Zi; $1)) ds. (37) 


The last equation can be solved for s whenever |z*| is sufficiently small, 
so that we obtain 
8 = s(t, Z1, 81). (38) 


Substituting this expression for s in the solution z*(s, z{, 81) of equation 
(34), we have 
2¥(t) = 2*(s(t, 21, 81), 21, 81)- (39) 
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Formulas (38) and (39) together give the solution of the system (32), 
(38) with the initial values tg, z7, $1. From (37) it follows that for t > to 
we have 

Ist, 21 81) — t| S [sx — tol + Uzi, (40) 


where | is some positive constant. In the particular case when zt = 0, 
$8; = fg, the solution (38), (39) has the form 


z*i) = 0, s(t) = 1. 


From the estimates (36) and (40) it follows that this solution of the sys- 
tem (32), (33) is Lyapunov stable. 

If we substitute the solution (388), (39) into the transformation formula 
(28), we obtain the solution x = g(t, x,) of equation (1) with initial 
values ¢ = to, ¥X = x; = g(zi,s,). Since (28) is a one-to-one mapping 
onto some neighborhood of the pair z* = 0, s = to, any solution ¢(E, x1) 
of equation (1) with initial values to, x, can be obtained in this way from 
a certain solution (38), (39) of the system (32), (33) for sufficiently small 
x; — Xo|. Here the solution x = g(t) is obtained from the solution 
z* = 0, s=t. Now the Lyapunov stability of the original periodic 
solution x = g(t) follows from the Lyapunov stability of the solution 
z* = 0, s =2 and from the uniform continuity of (28). Thus Theorem 
26 is proved. 

Let us apply the results obtained here to the case of a limit cycle. 

(D) We shall assume that the autonomous system (1) [see (3)] is of the 
second order: 

+ = fi(z', 2”) = f'(@), t= 1, 2, 
and let 
x= o(?) 


be its periodic solution with period 7. Here system (7) has the form 


p= FCO) 4 FCO) 2 pa, 


ox! Ox? 


By proposition (A) one characteristic number of this system is equal to 
unity; the second will be denoted by X. Then 


= exp | , (eo) n aPlelo)) a} an 
; 


0x1 0x2 


Thus if 


[ , (Heo) +O) 4 <0, 


dz 2 
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then the periodic solution x = g(t) is Lyapunov stable. In fact (see ex- 
ample) there exists a succession function X(u) of the periodic solution 
x = g(t) (see §28) for which 


x’(uo) = d, (42) 


so that for \ ¥ 1 the periodic solution x = g(t) is a rough limit cycle. It 
is stable for \ < 1 and unstable for A > 1. 

Let us prove the inequality (41). The fundamental matrix C of the 
solution Y = W(t) of the equation Y = A(t)Y with initial value ¥(0) = E 
[see (A)] is given by the equality 


C = V(r). 


By Liouville’s formula we have 


Det ¥(r) = Det (0) - exp (f, S() a) ’ 


where 


af?(y(t)) 


Ox? 


af (o()) 


0x1 


S(t) = ai(t) + a3(t) = + 


[see §17, (G)]. In our case, when the matrix C is of second order, and one 
of its eigenvalues is equal to unity and the other is equal to A, we have 


>» = DetC = exp i (Ceo) + ret) a 


0x1 


EXAMPLE 


Let o(t) be the periodic solution of the autonomous system (1) [see (3)] 
with period 7 with the initial values ¢p, Xo. The solution of this system with 
initial values to, & will be denoted by g(t, —&). We shall construct for the 
solution g(t) an analogue of the succession function (see §28), which here 
will be a mapping of the (n — 1)-dimensional space of variables into itself. 

Let 

x = g(u); u= (u',...,u"!) (43) 


be the equation of a surface which intersects the trajectory g(t) at the 
unique point 
Xo = o(to, Xo) = B(Uo) (44) 


and which is not tangent to the trajectory g(é) at this point, so that the 
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vectors 
dg (Uo) yee , 28(Uo) (45) 


P(t), aul aun—1 


are linearly independent. Let us find the intersection of the trajectory 
g(t, g(u)] with the surface (43) at ¢ close to tg + 7, assuming that ju — uo| 
is small. Let g(v) be the point of intersection; then the relation 


e(t,g(u)) — g(v) = 0 (46) 
holds. For u = uo, we have the trivial solution of equation (16): 
t= to + T, v= Uo 


{see (4) and (44)]. Here we consider u as an independent variable and 1, 
vas unknowns. Since the functional determinant of (46) at ¢ = to +7, 
U = Up, V = Ug with respect to the unknown functions ¢ and v does not 
vanish because of the linear independence of the vectors (45), then for 
small |u — uo|, there exists a solution 


t= t(u), v= X(u) 


of (46) with small |t(u) — (to + 7)| and |x(u) — uo]. The mapping 
Xx(u) of the space of variables u',..., u”—! into itself (which is defined for 
small |u — uo) will be called the succession mapping. To each solution 
u = u, of the equation 

x(u) —u=—0 (47) 


corresponds the periodic solution g[t, g(u,)] of the autonomous equation 
(1) [see (3)] with a period close to 7; in particular, to the solution u = uo 
corresponds ‘the original periodic solution g(t) = g(t, g(uo)). If the func- 


tional matrix 
3x*(uo) _ 
M = “our J’ 4j=1,...,.—-1, 


does not have eigenvalues equal to unity, then the solution u = uo of (47) 
is isolated. Indeed, the functional matrix of (47) at u = uo is equal to 


M — E*. 


For the determinant of this matrix not to vanish, it is necessary and sufh- 
cient that the matrix M not have an eigenvalue equal to unity. 

We shall now answer the question of whether every periodic trajectory 
K passing close to the trajectory K described by the solution y(t) can be 
described by the solution ¢g[é, g(ui)], where u,; is some solution of equa- 
tion (47). This was just the situation in the plane case (n = 2). It turns 
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out that for n > 3 the situation is different. Let us examine this question. 
We shall assume that 7 is a minimal period of the solution g(t), i.e., that 
the equality 

e(to + t) = elto) 


can be valid only when t = kr, where k is an integer {see §15 (C)]. If the 
trajectory K, is close to the trajectory K, then it intersects the surface (43) 
at some point g(u,), where ju, — up| is close to zero. We shall set 


Ung = X(u;), Ug = X(U2), ..., Uy = X(U), 


Since the trajectory K, is closed, we can find a point in this sequence co- 
inciding with the point u,; let u,4, be the first such point. Then the tra- 
jectory K, is described by the solution ¢[t, g(u;)], where its minimal period 
is close to the number kr; the solution ¢[t, g(u,)] closes only after it goes 
around the trajectory g(t) k times. In the plane case only the case k = 1 
is possible. We shall call the number & the multiplicity of the trajectory 
K,. For determining double trajectories it is necessary to solve, not equa- 
tion (47), but the equation 


x(x(u)) —u= 0; 
for determining triple trajectories it is necessary to solve the equation 
x[x(x(u))] — u = 0, 


and so on. The functions x(x(u)), x[x(x(u))],... are called iterations 
of the function x(u); the kth iteration will be denoted by x*(u). Thus to 
find all k-tuple periodic solutions close to the solution g(é), the equation 


x*(u) -u=0 (48) 


must be solved, but from al! solutions of equation (48), only those are to 
be taken which are not solutions of equations of multiplicities encountered 
previously; the solution u = wg of equation (47) is also the solution of any 
equation (48). The functional matrix of equation (48) at u = wg is ob- 
viously equal to M* — E*; thus in order that equation (48) have only one 
solution u = wo close to uo, it is sufficient that the determinant of the 
matrix M* — E* not vanish or, what is the same thing, that the matrix 
M* not have eigenvalues equal to unity, or finally, that the matrix M not 
have eigenvalues equal to WT. Thus, in order that there not be periodic 
trajectories of a given multiplicity k in the neighborhood of the trajectory 
K, it is sufficient that the matrix M not have eigenvalues equal to WI. In 
particular, the matrix M does not have such eigenvalues if the absolute 
values of all its eigenvalues are less than unity. 
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From what has been said, it is evident how important a role the matrix 
M plays in the study of trajectories of the autonomous equation (1) 
[see (3)] which are close to the periodic solution g(t). We shall now show 
that if equation (7) has a characteristic number equal to unity of multi- 
plicity one, then for some choice of the surface (43) the matrix M coincides 
with the matrix C* [see (23)]. Let us set 


_ de'(t, €) 


WO = Se | YO = Wil). 
By Theorem 15 we have 
V(t) = A()¥(d), (49) 
where the initial condition 
Vio) = EF 


is fulfilled. Thus the matrix Y(t) represents the solution of the matrix 
equation (49), which is the matrix form of equation (7), and therefore 


Vito + T) = C. 


Since the matrix C has an eigenvalue equal to unity of multiplicity one, 
we can choose a basis in the space of vectors y [see (A)] such that the 
matrix C can be written in the form (23). Let us now take for the co- 
ordinates in the phase space of equation (1) [see (3)] the components of the 
vector y, by setting 

X= o(to) +y 


[compare (5)]. The coordinates thus obtained in the phase space will 
again be denoted by x!,... , x”, and the surface (43) will be defined by the 
equations 


If we differentiate (46) with respect to u!,..., u”~' atu = 0,t = to 417, 
v = 0 under the assumption that £ = t(u) and v = xX(u) are functions of 
the variables u1,..., u”—!, we obtain the equality 


c* = M. (50) 


In the case n = 2, the matrix C* is a scalar A, and the relation (50) gives 
(42). 

If the absolute values of all eigenvalues of the matrix C* are less than 
unity, then there are no periodic solutions of any multiplicity in the 
neighborhood of the trajectory K. This follows from the estimate (36). 


CHAPTER 6 
LINEAR ALGEBRA 


This chapter is intended to supplement the basic material of the book. 
We present here those results in the area of linear algebra which have been 
used in some of the more advanced sections. It should be noted that §34 
is based only on the results of §32 and does not use the results of §33 at all. 


32. The minimal annihilating polynomial. Eigenvalues and eigenvectors. 
(A) To each nth-order square matrix 


A = (a}), 7,j=1,2,...,2, 


whose elements are real or complex numbers there corresponds a linear 
transformation A of the n-dimensional vector space #; namely, to the 
vector 


x = (z!,..., 27) 
of the space R corresponds the vector 
Ax = y= (y',..-,y%); 


which is defined by the relation 
y= >» aig’. 
j 


To the zero matrix 0 (all of whose elements are zero) corresponds the zero 
transformation 0 which transforms every vector into zero. To the unit 
matrix 


E= (8), = 


corresponds the unit or identity transformation E of R: 


Ex = x. 
If in the space R we introduce the new coordinates x’!,..., z’” which 
are related to the old coordinates z1,...,2x” by the transformation 


formulas 


or, in matrix form, 
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then the matrix 1 

A’ = SAS— (1) 
corresponds to the transformation A in the new coordinate system. Let 
us prove relation (1). We have 


y’ = Sy = SAx = SAS™'x’. 


(B) Let A be the linear transformation and A the matrix corresponding 
to the transformation A in some coordinate system. A nonzero vector h 
is called an eigenvector of the transformation A, and the number ) is called 
an eigenvalue of this transformation corresponding to the vector h, if the 


relation 
Ab = Ah (2) 
holds. The determinant of the matrix (a} — 28}), 
De) = a; — 283] = |A — Bl, 


is called the characteristic polynomial of the matrix A. The coefficients of 
the polynomial D(z) do not depend on the choice of the system of co- 
ordinates but are completely determined by the transformation A. There- 
fore the polynomial D(z) is called the characteristic polynomial of the 
transformation A. Furthermore, the number X is an eigenvalue of the 
transformation A if and only if it is a root of the polynomial D(z). 

We shall prove that the polynomial D(z) is independent of the choice 
of the coordinate system. In the new system of coordinates, the matrix 
SAS! corresponds to the transformation A, where S is a certain non- 
singular matrix [see (1)]. We have 


|SAS~! — zE| = |SAS~* — zSES~'| = |S(A — zE)S™| 
= |8|-|A — 2B] -|S~*| = |S|-|A — zB] -|S|~" = |A — 2B. 
Let us now write the relation (A — AE)h = 0, which is equivalent to 


relation (2), in terms of the coordinates 


dS @i—rAsphei=0, i=1,...,0. 


j=l 


This system of homogeneous equations has a nontrivial solution h!,..., h” 
if and only if the determinant D(A) of this system is equal to zero. Thus 
every root \ of D(z) is an eigenvalue of the transformation A, and con- 


versely. 
(C) If the eigenvalues 1, ..., Ax of the transformation A are pairwise 
distinct, then the corresponding eigenvectors h,,..., h; are linearly in- 


dependent. 
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The proof is by induction on the number k. For k = 1 this assertion is 
obvious. Let us assume that it is true for k — 1 vectors, and prove it for 
k vectors. Let us assume that a,h; + --+-+ axh, = 0. Applying the 
transformation A to this relation, we obtain 


ayA;h; +--+ + apduh, = 0; 
on the other hand, 

hz (a,b, + +--+ azh,) = 0. 
If we take the difference of these relations, we obtain 


ay(A; — Ax)hy + +++ + @x—i(Az—1 — Ax)hy_-1 = 0. 


Hence, by the induction hypothesis, it follows that a, = --- = a,_, = 0. 

Thus if all roots of the characteristic polynomial D(z) are distinct, we 
can take the eigenvectors h,,...,h, of A asa basis of the space R. In 
this basis a diagonal matrix corresponds to A. In the general case, the re- 
duction of the transformation matrix to diagonal form is impossible, and 
we are forced to construct a comparatively complicated theory, which 
we now proceed to do. 

The minimal annihilating polynomial. (D) By well-known rules we can 
add and multiply square matrices of order n, as well as multiply them by 
given numbers; to these operations on matrices correspond the same opera- 
tions on transformations. Thus if 


f(z) = age” + az” +--+ + On 


is a polynomial with real or complex coefficients in z, then by substituting 
the matrix A for the variable z in this polynomial, we obtain the matrix 


f(A) = apA™ + a, A") + +--+ ank, 


which is a polynomial in A. The polynomial f(A) of the transformation A 
is defined similarly. If f(z) #0 and if f(A) is the zero matrix (in which 
case it is also, obviously, a zero transformation), then the polynomial f(z) 
is said to annthilate the matrix A and the transformation A. Thus the 
characteristic polynomial D(z) of the matrix A annihilates A: 


D(A) = 0. 


For the proof let us consider the n-dimensional vector space R with the 
basis 
@1, Co,..., Cn, 
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and let us study the coordinate system which corresponds to this basis, so 
that 
Ge; = (0,...,1,..., 0), 


where the coordinate | is in the j-th place. We shall denote by A the trans- 
formation corresponding to the matrix A in the basis chosen. Then we 


have 
Ae; = > aj. 
s 


or, what is the same thing, 


>> (aE — 6jA)e, = 0. (3) 


Let us set 
Lj(z) = a} — 852. 


Here Li(z) is a polynomial in z of degree zero or one, and 
(Li(2)) 


is the matrix formed from the polynomials. The cofactor of the element 
Li(z) in this matrix will be denoted by Mi(z), so that the relation 


DMi2) Liz) = D(z) (4) 


holds. Multiplying (3) on the left by the polynomial (A) and summing 
over j, we obtain by (4): 


>) Mi(A)(@JE — sfAje, = D> Mi(A)Li(A)e, 


8,j 


= }° sD(A)e, = D(A)e; = 0. 


Thus D(A) transforms all basis vectors of the space R into zero, so that it 
is the zero transformation, which means that the corresponding matrix 


D(A) is also zero: 
D(A) = 0. 


(E) In the set of all polynomials annihilating the matrix A (or the 
transformation A), there exists a unique polynomial A(z) of minimal 
degree, which is unique up to a numerical factor; this polynomial A(z) 
is a divisor of all other polynomials which annihilate the matrix A and 
is called the minimal polynomial which annihilates the matrix A. We 
shall adopt the convention that the coefficient of the term of highest degree 
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of the polynomial A(z) is equal to unity. Whenever the matrix A is real, 
the polynomial A(z) is real. 

To prove (EK), we recall that if f(z) and g(z) are two arbitrary poly- 
nomials and d(z) is their greatest common divisor, then the identity 


d(z) = -p(z)f(z) + alz)g@) (5) 


holds, where p(z) and q(z) are suitably chosen polynomials. The existence 
of the identity (5) can be proved by the division algorithm for polynomials. 
From (5) it follows that if the polynomials f(z) and g(z) annihilate the 
matrix A, then their greatest common divisor d(z) also annihilates A. 
From (D) it follows that there exist polynomials which annihilate the 
matrix A. Now let A(z) be a polynomial of minimal degree which an- 
nihilates A, and let f(z) be an arbitrary polynomial also annihilating A. 
If the polynomial f(z) is not divisible by A(z), then the greatest. common 
divisor of these polynomials would have a degree smaller than that of A(z) 
and also would annihilate A, but this is impossible by hypothesis. Now if 
A is real, then 
0 = A(A) = A(A) = ACA). 


Thus, the polynomial A(z) annihilates A and therefore is divisible by A(z), 
but this is possible only if A(z) = A(z). Thus, proposition (E) is proved. 

(F) Let A(z) be the minimal annihilating polynomial of the matrix A. 
The number ) is an eigenvalue of the matrix A if and only if it is a root of 
the polynomial A(z). 

To prove this we denote by A the transformation of the n-dimensional 
vector space which corresponds to the matrix A. We remark that if f(z) 
is an arbitrary polynomial, then 


Ah = Ab implies f(A)h = f(\)h. (6) 
Indeed, we have 
Eh = h, Ah=h, A*h = A\h= 2h, ..., A™h = Xh. 


Multiplying these relations by the coefficients of the polynomial f(z) and 
summing, we obtain (6). 

Let us assume that A is an eigenvalue of the matrix A; then there 
exists a vector h # 0, such that Ah = dh, and (6) implies A(A)h = A(\)h; 
but A(A) = 0, so that A(A) = 0. Conversely, if \ is a root of the poly- 
nomial A(z), then A(z) = (z — A)T(z). Since A(z) is the minimal an- 
nihilating polynomial of the matrix A, the polynomial I'(z) does not an- 
nihilate it, so that the matrix [(A) and consequently the transformation 
T'(A) are nonzero. Thus there exists a vector x, for which T'(A)x = h’ ~ 0, 
and we have 0 = A(A)x = (A — AE)I(A)x = (A — AE)b’, so that 
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Ah’ = hb’, whence ) is an eigenvalue of the matrix A. Hence, proposition 
(F) is proved. 

If necessary, the vector space & can be taken to be real whenever it has 
only vectors with real coordinates, or as complex whenever it has vectors 
with complex coordinates. If x = (z!,..., 2”) is a vector of a complex 
space R, then ¥ = (Z!,..., 2”) is the vector which is the complex con- 
jugate of x. If S is a subspace of a complex space R, then the subspace 8, 
consisting of all vectors which are complex conjugate to the vectors of S, 
is defined as the complex conjugate of the subspace S. 

The complex (or real) space FR is said to be decomposed into a direct 
sum of its subspaces S, and Sp» if every vector x from R can be written 
uniquely in the form of a sum: 


K=x,+%, 


where the vector x; belongs to the subspace S;, 7 = 1, 2. 

(G) Let A(z) = A,(z) Ae(z) be a factorization of the minimal poly- 
nomial annihilating the matrix A into two relatively prime factors. We 
denote by S;, 7 = 1, 2, the linear subspace of the space R consisting of 
all vectors x of R which satisfy the condition A;(A)x = 0, where A is the 
transformation with matrix A. It turns out that the space R can be de- 
composed into a direct sum of the subspaces S; and S». (If the matrix A 
is complex, then in the statement formulated here the space R must be 
considered complex.) Let us now assume that the matrix A is real; then 
it is necessary to distinguish two important cases. (1) The factors A,(z) 
and A,(z) are real; then the space RF and its subspaces S; and S»_ can be 
considered real. (2) The factors A,(z) and Aj(z) are complex conjugate; 
then the space R should be considered complex and its subspaces S, and 
S_ turn out to be complex conjugate. 

Let us prove proposition (G). Because the factors A,(z) and Ag(z) are 
relatively prime, the identity 


1 = pi(z) Ay(z) + pe(z) Ae{z) (7) 


is valid, where p;(z) and pe(z) are suitably chosen polynomials [see (5)}. 
We note that if the factors A,(z) and A(z) are real, then the polynomials 
p\(z) and po(z) can be chosen as real, since they are obtained by applying 
the division algorithm to the polynomials A(z) and A2{z). Now let x 
be an arbitrary vector of R; by (7) we have 


X = pi (A) Ai(A)x + po(A) A2(A)x. 
Setting 
X, = po(A) A2(A)x, =-X2 = pi(A) Ai (A)x, 
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we obtain the decomposition x = x, ++ X2, where 


0, 
0, 


Ai(A)x, = 41(A)p2(A) 42(A)x = p2(A) A(A)x 
A2(A)X2 = A2(A)p;(A) A1(A)x = pi(A) A(A)x 


so that the vector x; belongs to the subspace S;. Now if x = xj + x3 is 
any decomposition of the vector x into a sum in which x/ belongs to Sj, 
zt = 1,2, then by (7) we have 


X{ = pi(A) Ai(A)x{ + po(A) A2(A)x, = po(A) A2(A)(xi + x2) = x1; 


in exactly the same way X, = Xz, and the uniqueness of the decomposition 
is proved. 

If the matrix A is real and the factors A,(z) and Ao(z) are real, then, 
starting from the real vector x, we obtain real vectors x; and X2. However, 
if the matrix A is real and the factors A;(z) and A2(z) are complex con- 
jugate, then the subspaces S; and S_ are complex conjugate by definition. 
Thus proposition (G) is proved. 

(H) Let A be a linear transformation of the n-dimensional space R, let 


AAz,-- +, Ae 
be the set of all eigenvalues of this transformation, let 
A(z) = (2 — ¥1)*1(z — dg)*# ++ + (2 — Ae) 
be the minimal annihilating polynomial of the transformation A, and let 
D(z) = (—1)"(z — Ai) {(z — Az) 9 ++ + (2 — De) 8 (8) 


be the characteristic polynomial of the transformation A. Since the 
polynomial D(z) is divisible by the polynomial A(z), we have 


qa = ki, t=1,...,7r. 


? 


By (G) the space R can be decomposed into a direct sum of the subspaces 
S81, Se,..., S;, where S; consists of all vectors x which satisfy the condi- 


tion 
(A — dE)*x = 0. 


Thus the space S; is of dimension g;, and the number q; is called the 
multiplicity of the eigenvalue ),. 

We shall prove that the dimension of the space S; is equal to q;. The 
space S; is invariant with respect to the transformation A, that is, AS; is 
contained in S;. Thus, if in the space S; a certain basis is chosen, then 
to the transformation A on S; there corresponds a certain matrix A; 
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of order p;, where p; is the dimension of the space S;. If the basis of R 
consists of the bases of all the spaces S;, then in the basis obtained, there 
will correspond to the transformation A a matrix A consisting of matrices 
A,,..., A, located along the diagonal of the matrix A. Jrrom this it is 
clear that the characteristic polynomial of the transformation A of R 
will be equal to the product 


D,(z2)D2(z) . .. D,(2), 


where D,(z) is the characteristic polynomial of the transformation A con- 
sidered on the subspace S;. Since A,(z) = (z — \,)*i is the annihilating 
polynomial of A on the subspace S;, the transformation A on S; has only 
one eigenvalue i;, so that D;(z) has the form (—1)?#(z — \,)?#, because 
its degree is equal to the order of the matrix A;, that is, to the dimension 
p; of the space S;. Consequently D(z) = (—1)"(z — A 1)"1(2 — dAg)?2... 
(2 — d,)?", and therefore p; = gq; [see (8)]. Thus proposition (H) is 
proved. 


33. Matrix functions. In this section we shall make no distinction be- 
tween the transformation A and the corresponding matrix A, since we 
shall not change the coordinate system. In addition, in this section we 
shall also use some elementary facts from the theory of functions of a 
complex variable (see, for example, Ahlfors, Complex analysis, McGraw- 
Hill, New York, 1953). 

Matrix power series. (A) Let 


A(z) = (2 — Au)*t(z — Ag)Pa + + (2 — Ay)”, 


(1) 
ki >0, t=1,...,7, ki thot-+++th =k, 
be a minimal annihilating polynomial of the matrix A, where 
Ay, Ag, Ae (2) 


are its pairwise distinct roots. By proposition (I*) of §32 the numbers (2) 
constitute the set of all eigenvalues of A. We say that the function W is 
defined on the spectrum of the matrix A if there is a correspondence between 
the eigenvalue ); of the matrix A and the sequence of numbers 


Wr), WPAD,..., WPA), t= 1,...,7 (3) 


If W(z) is a function of the complex variable z which is holomorphic at the 
points ),;, then, if the numbers (3) are regarded as the value of the function 
itself and of its derivatives up to order k; — 1 at the point \;, we obtain a 
function defined on the spectrum of the matrix A. If for two functions of 
z, the values (3) coincide, respectively, then we shall say that the two 
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functions coincide on the spectrum of the matrix A. Thus two polynomials 
f(z) and g(z) coincide on the spectrum of A if and only if f(A) = g(A). 
Furthermore, it turns out that for any arbitrarily given numbers (3), 
there always exists a unique polynomial ¢(z) of degree not larger than 
k — 1, whose values on the spectrum of A coincide with the numbers (3), 
ie., 


e?(%) = W?), g=0,...,4—-—1, a=1,...,r. (4) 


Here the coefficients of ¢(z) are linear functions of the quantities (3) and 
therefore depend continuously on them. 

We shall prove these statements. If we set h(z) = f(z) — g(z), andif 
f(A) = g(A), then h(A) = 0. Furthermore, if the values of the poly- 
nomials f(z) and g(z) coincide on the spectrum of A, then the function 
h(z) vanishes on the spectrum of A. Thus, in order to prove that part of 
the assertion of (A) concerning f(z) and g(z), it is sufficient to prove that 
the polynomial A(z) annihilates the matrix A if and only if it vanishes on 
the spectrum of this matrix. To prove this, let us assume that h(z) an- 
nihilates A; then by (E) of §32 it is divisible by the polynomial A(z), and 
therefore \; is a root of multiplicity not less than k; [see (1)], and from 
this it follows that it vanishes on the spectrum of A. If the polynomial 
h(z) vanishes on the spectrum of A, then the number ); is a root of multi- 
plicity not less than k,, and therefore h(z) is divisible by the polynomial 
A(z) [see (1)]. Hence it follows that h(A) = 0. 

We shall now prove that part of proposition (A) concerning the function 
¢(z). The set of relations (4) can be regarded as a system of linear equa- 
tions in the coefficients of the polynomial y(z). This system has k equa- 
tions with k unknowns. To prove statement (A) it is sufficient to show that 
the determinant of this system is distinct from zero, and for this in turn 
it is sufficient to prove that whenever the right-hand sides of these equa- 
tions vanish, there exists only the zero polynomial ¢(z) satisfying condi- 
tions (4). Whenever the right-hand sides of (4) vanish, the polynomial 
g(z) vanishes on the spectrum of the matrix A so that it is divisible by 
A(z), and, since it is of degree not greater than k — 1, it is identically zero. 
Thus proposition (A) is proved. 

(B) Let A be a real matrix; then its minimal annihilating polynomial 
A(z) is real [see §32, (E)], so that to each root ; [see (1)] corresponds a 
complex conjugate root 4; of the same multiplicity. Thus, if the numbers 
(3) satisfy the conditions 


WX) = WA), g=0,...,k—1, i= 1,...,7, (5) 


the polynomial g(z) defined by (4) is real, so that the matrix ¢(A) is also 
real. 
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To prove proposition (B), we shall denote the coefficients of the poly- 


nomial ¢(z) by ¢!,...,¢*. Without going into details, the system (4) 
in the unknowns ¢!,..., ¢* may now be written in the form 
k 
doe = da = 1,...,k. (6) 
A=1 


By (5) this system has the property that, for each of its equations, there is 
also a complex conjugate equation, i.e., the equation 


k _ 
> ae? = a. (7) 


The set of relations (7) represents a system of linear equations in the 
unknowns @!,..., ¢*. However, in view of the property which has been 
formulated for (6), it must coincide with (7), the only possible difference 
being the numbering of the equations. Since system (6) has a nonzero 


determinant, its two solutions ¢!,..., ¢* and g!,..., @* coincide, that is, 
¢g’ = @, a= 1,...,k, so that the numbers ¢!,..., ¢* are real. Thus 
proposition (B) is proved. 
Let 
F(Z) = ag + ayz + az? + +++ + an2™ + ++: (8) 


be an analytic function of the complex variable z defined by the series (8) 
with radius of convergence p, so that for |z| < p the series (8) converges, 
and for |z| > p it diverges. 

For the sequel we recall that the series 


f'(z) = ay + Qagz + +++ manz™ 1 +--+, 


obtained from (8) by means of formal differentiation, has the same radius 
of convergence as (8) and converges inside the circle of convergence to the 
derivative f’(z) of f(z). 

It can happen that by substituting the matrix A for z in (8), we obtain 
a convergent matrix series 


f(A) = ag + ayA + +++ + an,A™ e+. (9) 


(A matrix series is called “convergent” if the numerical series consisting 
of the elements in the 7-th row and j-th column converge for arbitrary 
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i,j = 1,...,n.) In this case we say that the function f(z) is defined on 
the matrix A. 


Tueorem 27. We shall retain the notation of proposition (A). If all 
eigenvalues of the matrix A lie inside the circle of convergence of (8), 
1.€., 

ld] < p, a= 1,...,7, 


then the matrix series (9) converges, so that the matrix f(A) is defined. 
The numbers 
fy), t= 1, reg ly (10) 


which need not be distinct, comprise the set of all eigenvalues of the 
matrix f(A). In addition, if the eigenvalues \; of the matrix A lie inside 
the circle of convergence of the series defining a certain function g(z), 
so that the matrix g(A) is defined, then, in order that the matrices f(A) 
and g(A) coincide, it is necessary and sufficient that the functions f(z) 
and g(z) coincide on the spectrum of the matrix A. 


Proof. Let us form the partial sum 
Sm(2) = A + a2 + +++ + One 
of the series (8); then for |z| < p we have 
f@) = lim far). 


In addition, let ¢»(z) be a polynomial of a degree not exceeding k — 1 
which coincides with the polynomial f,,.(z) on the spectrum of the matrix 
A [see (A)]. Since the eigenvalues (2) of A satisfy the condition|\;| < p, 
t= 1,...,7, we have 


lim eQ) = fA), g=0,...,%—-1, t=1,...,7. 


From this it follows by proposition (A) that the sequence of polynomials 
¢m(z2) converges coefficient-wise to some polynomial ¢g(z) of degree 
<k — 1, where the polynomial ¢(z) and the function f(z) coincide on the 
spectrum of A. Since the polynomials ¢,,(z) and fin(z) coincide on the spec- 
trum of A, we have 

Jm(A) = em(A). 


As m — oo the right-hand side tends to ¢(A), and this means that the 
left-hand side also converges as m — oo. Thus the series (9) converges to 
the matrix f(A) = ¢(A). 
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We shall now prove that the polynomial 
T(z) = [2 — fd le — fre) - + fz — FO.) 
annihilates the matrix f(A). For this we consider the polynomial 


&,,(z) = [em(z) a m1) [Gm (2) _ Om(dz2)|* oes [em(2) —_ @m (Ar) |*? 
(11) 


and show that it annihilates A. The polynomial gn(z) — ¢n(d;) vanishes 
at z = 23, so that it is divisible by the expression z — ;. Thus, the poly- 
nomial (11) can be written in the form 


Pn(z) = VYm(z) A(z), 
so that the polynomial ©,,(z) annihilates A: 
[em(A) — gm(1)E}[¥m(A) — gm(d2) El... [em(A) — gm(Xe)E}*r = 0. 
Passing to the limit in this relation as m — ow, we obtain 
(f(A) — fOETLA(A) — S02) ET ++ f(A) — fO)E = 0, 


which means that the polynomial ['(z) annihilates the matrix f(A). 

Tn particular, it follows from what has been proved that all eigenvalues 
of the matrix f(A) are contained among the numbers (10) [see §32, (F)]. 
We shall prove that every number (10) is an eigenvalue of f(A). Let 
h; be the eigenvector of A corresponding to the eigenvalue };, so that 


Ah; = d, hy. 


From this it follows, by formula (6) of §32, that f,,.(A)hi = fnQhi. 
Passing to the limit in this relation as m — oo, we obtain 


S(A)be = fOdhi. 


Thus the number f(A,) is an eigenvalue of the matrix f(A). 

Let us now assume that the circle of convergence of the function g(z) 
also contains all the’eigenvalues of A. It then follows from what we have 
proved that the matrix g(A) is defined and that there exists a polynomial 
¥(z) of degree not exceeding k — 1 which coincides with the function g(z) 
on the spectrum of A, where ¥(A) = g(A). Now if f(A) = g(A), then 
e(A) = ¥(A), and by proposition (A) the polynomials ¢(z) and y(z) 
coincide on the spectrum of A, so that f(z) and g(z) must also coincide 
on the spectrum of A. Conversely, if the functions f(z) and g(z) coincide 
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on the spectrum of A, then the polynomials ¢({z) and y(z) also coincide on 
this spectrum, so that, by proposition (A), g(A) = y(A), which implies 
f(A) = g(A). Thus Theorem 27 is proved. 

Implicit functions of matrices. Let F(z, w) be a function of two complex 
variables defined by the series 


F(z,w) = a+ bz + cw + dz? + eew+ fw? +--+. (12) 


When the order of the factors in the terms of this series changes (for ex- 
ample, when the product z*w* is replaced by wz*), the function F(z, w) 
does not change. Therefore, if we replace the arguments z and w by matrices 
A and B in (12), it is natural to confine ourselves to the case when the 
matrices A and B commute. If (12) converges for arbitrary values of the 
variables z and w, then it can be proved that by substituting for z and w 
in this series any matrices A and B which commute, we obtain a con- 
vergent matrix series which will define a certain matrix which we shall 
denote by F(A, B). However, we shall not prove the convergence of this 
series in the general case, since we shall consider below only particular 
cases in which there is a finite number of terms depending on z, so that in 
fact the question is really that of the convergence of a series of the one 
complex variable w. 

(C) Let F(z, w) be an analytic function of two variables defined by (12), 
which is convergent for all values z, w, and let A be a given matrix. Further, 
suppose that we have a correspondence between every eigenvalue \; of 
A and a number yu; which satisfies the conditions 


FQ; ms) =0, SFOsu) #0, t= 1.0% (3) 


Then there exists a matrix B, which commutes with A, satisfying the con- 
dition 
F(A, B) = 0. (14) 


Furthermore, if the coefficients of (12) and the matrix A are real and if for 
every two complex conjugate eigenvalues \; and 4; = 4; of the matrix A 
the corresponding numbers yu; and yu; are also complex conjugate (u; = &;), 
then there exists a real matrix B, which commutes with A and which 
satisfies condition (14). 

We shall prove proposition (C). It follows from (13) and the implicit 
function theorem in the complex case that for any 1 = 1,...,r there 
exists a function W(z) = W,(z) which is defined for z close to \; and which 
satisfies the conditions 

F(z, W(@)) = 0, (15) 


Wa) = i; t= 1, see gy (16) 
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To determine the derivatives W“(A,) of W(z) at the point z = rj, it is 
necessary to differentiate successively relation (15) with respect to z and 
then to substitute in it z = Aj: 


@ 


a F(z, W(z))|z-a,; = 0. (17) 


For these relations, the numbers 
WW?) g=1,...,4—-1, t=1,...,7, (18) 


can be determined successively. Starting from (16) and (18), we shall 
construct the polynomial ¢(z) which satisfies conditions (4). We shall 
show that the matrix B = ¢(A), which obviously commutes with A, 
satisfies (14). 

For the proof we shall substitute in (12) the value w = ¢(z). We then 
obtain the function B(z) = F[z, e(z)] of z. To prove (14) it is sufficient 
to show that (z) is equal to zero on the spectrum of the matrix A (see 
Theorem 27). By calculating the derivatives 6 (;) of &(z) at the point 
ui, J = 0, 1,...,%; — 1, we can substitute the function W(z) for the 
polynomial ¢(z), since the respective derivatives of these functions of 
orders 0, 1,...,4; — 1, at the point \; are equal. But by replacing the 
polynomial ¢(z) in F(z, ¢(z)) by the function W(z) defined in the neighbor- 
hood of \;, we obtain an identity [see (15)]. Thus the function ®(z) vanishes 
on the spectrum of A. 

We shall now prove that if the coefficients of (12) and the matrix A 
are real and the numbers y; satisfy the requirement of conjugacy, ie., if 


Wi) = WO), i=1,...,r, 


then the polynomial ¢(z), as well as the matrix B = ¢(A), is real. Indeed, 
under these hypotheses the numbers W(d;), coming from (17), satisfy 
(5), so that the polynomial ¢(z) is real [see (B)]. Thus, proposition (C) 
is proved. 

(D) The analytic function e’ of z is defined by the series 


m! 


2 m 
P= ltet yt tate (19) 


which converges for all values of z. As is known, the identity e’t” = e? - e”, 


which follows from the properties of (19), is valid for two arbitrary com- 
plex numbers z and w. Hence it follows that the identity 


eAtB — ¢A. 9B (20) 


is valid for two commuting square matrices A and B. Thus, for any non- 
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singular matrix A there exists a matrix B, commuting with A, which 
satisfies the condition 
eF = A. (21) 


In addition, we find that for any real nonsingular matrix A there exists a 
real matrix B,, which commutes with A and satisfies the condition 


ei = A, (22) 


To prove that (21) can be solved for B it is sufficient to apply proposition 
(C) to the function F(z, w) = e” — z. Indeed, since the matrix A is 
nonsingular, all its eigenvalues ); are distinct from zero, so that there exist 
numbers yw; which satisfy the condition e*¢ — \; = 0 [see the first of the 
relations (13)]; the second of the relations (13) are obviously satisfied here. 

To prove the existence of a real matrix B, satisfying condition (22), it 
is sufficient to apply the second part of proposition (C) to the function 
F(z,w) = e” — 2”. Indeed, if ; is a real positive or negative number, 
then we set 4; = In Aj, where we take the real branch of the logarithm. 
If, however, \; is a complex number, then we can take complex conjugate 
numbers for W(A,) and W(k,). Thus, proposition (D) is proved. 


34. The Jordan form of a matrix. (A) The sequence of vectors 
hi,...,bn (1) 


of the space RF is called a basis set or a series with eigenvalue ) for the trans- 
formation A if the relations 


Ah, = Ahy, Ah» = Aho + h,, eney Ah,, =. Ahn + hn_1, h, x 0, 


are fulfilled. If the matrix A of the transformation A is real, then the 
sequence 


h,,..., Bn (2) 


obviously forms a series with eigenvalue X. The series (1) and (2) will 
be called complex conjugate. If the number \ and the vectors (1) are real, 
then the series is considered real. 


THEOREM 28. There exists a basis of the space FP consisting of all the 
vectors of one or more series for the transformation A. If the matrix A 
is real, then the series constituting the basis can be chosen in such a 
way that a series with real eigenvalues is real and series with complex 
eigenvalues are pairwise conjugate. 


Proof. Let 
A(z) = (2 — dy)". .. (2 — dy) (3) 
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be the minimal polynomial annihilating the matrix A, where 
ee 


are distinct eigenvalues of A. By proposition (G) of §32, the space R can 
be decomposed into a direct sum of the subspaces S,, ... , S, corresponding 
to the factors (3), so that the space S; consists of all vectors x which satisfy 
the condition (A — d,E)*ix = 0. This means that the annihilating poly- 
nomial of the transformation A taken on the space S; is the polynomial 
(z — d,)*i. It is easy to sec that this polynomial is minimal. 

Let us assume that A is real. I‘irst we combine all the factors of (3) 
with real \; into the factor A,(z) and all the other factors into the factor 
Ao(z). Then A(z) = A,(z) A(z) is a factorization into real relatively prime 
factors, and the corresponding decomposition of the space F into a direct 
sum of 2; and R, may be considered real. The space R, is now decom- 
posed into a sum of real summands corresponding to real eigenvalues ),;, 
and in these real direct summands we shall later construct bases consisting 
of real series. The space R, will be decomposed into pairwise complex 
conjugate direct summands corresponding to the complex conjugate eigen- 
values, and in these complex conjugate spaces we shall later construct 
bases consisting of complex conjugate series; here it is sufficient to con- 
struct the basis from a series in one of the two complex conjugate spaces, 
and in the other space to take the complex conjugate basis. 

Thus it is sufficient for us to prove that if the linear transformation A 
operating in the vector space S has the minimal annihilating polynomial 
(z — d)*, then in this space we can choose a basis consisting of series for 
the transformation A, and, moreover, of real series if the space S, the 
matrix A, and the eigenvalue } are all real. 

We shall go on to the proof of this statement. Jor brevity we assume 
that C = A — XE, and we denote by 7; the set of all vectors x of S which 
satisfy the condition 

Cx = 0. 
Then we have 
S=ToT'5d---sT' DST =0. 
Let 
hi,...,h, i=1,...,k, 
be a system of vectors from T; which are linearly independent with respect 


to the space 7;_,; this means that the vector 


ayh} +--+ + a,hj 
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can belong to space T;_, only if 
aj=...=a,= 0. 
We shall show that for fixed 7 and j the vectors 
hi_;=Ch, j <i, (4) 


belong to the space T;_; and are linearly independent with respect to the 
space T;_;_1. We have 


Chi; = Chi =0, a=1,...,%, 


and consequently the vectors (4) belong to the space T;_;. Let us assume 
now that the vector 
ayhj_j;+---+a,hi_;=x 


belongs to the space T;_;_1. Then we have 
0 = choy = C’—! (ayh} + ote + a,hj), 


and this means that the vector ayh; + --- + a,h} belongs to the space 
T;_, so that the numbers aj, ... , a, are zero. 
Let us select a maximal system of vectors 


hi,..., he (5) 


of 7; which are linearly independent with respect to T,_,. According to 
what has been proved, the vectors 


by_1 = Chyj, a=1,...,7., (6) 
belong to T,_, and are linearly independent with respect to T,,_2; thus 


system (6) can be augmented to a maximal system 


T — 
hi ,..., bey, Th-1 = Thy (7) 


of vectors of T,_1; which are linearly independent with respect to T;,_.. 
Continuing this process, we shall construct in the space 7;(2 > 0) a 
maximal system of vectors 


hi,...,h%, (8) 
which are linearly independent with respect to T;_1; here the relations 


hj = Chii1, a= 1, eee Titty, TH = Titd, 


will be fulfilled. 
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We shall now prove that the set >°; of all vectors which belong to all 
systems (8),2 = j,j — 1,..., 1, constitutes a basis of the space T;. The 
proof will be carried out by induction on the number 7. For j = 1 the 
system >>; coincides with the system (8) and therefore is a basis of the 
space T,, To = 0. Let us assume that our assertion has been proved for 
the system >°;, and we shall prove it for the system }°;41. Let us assume 
that the relation 


arb}y1 +--+ ta, heh + bib} +--+ b,H¥+---= 0 (9) 
holds. Applying the transformation B’ to (9) we obtain 


1 "i+ 
ayhi +--+ + 4,,, hi" = 0, 


and this is possible only if a] = --- = a,,,, = 0; thus relation (9) 
can contain only vectors of the system }>°;, so that by the induction 
hypothesis, relation (9) is trivial. Now let x be an arbitrary vector of the 
space T;4,. Since the system (8) forz = j + 1 forms a maximal linearly 
independent system for the space T,, there exists a vector 


1 "j4l 
y= abjyi +--+ +4, beh 


such that x — y belongs to 7; and, by the induction hypothesis, may be 
expressed linearly in terms of the vectors of the system >°,;, so that the 
vector x may be expressed linearly in terms of the vectors of 3°j41. 

Thus we have shown that >-; is a basis of the space S = Tx. 

If the space S, the matrix A, and the number ) are real, then by taking 
the vectors of (5) as real, we obtain a real system (6), which can be aug- 
mented to the real system (7). Continuing in this way, we obtain the real 


system or. 
We shall now show that the system >°, consists of series, i.e., we shall 
show that the vectors hf, hg, . . . form a series with eigenvalue X. We have 


0 = Chi = (A — dE)hi, 
so that Ahf = Ahf; further, 
hi = Ch = (A — XE)hz, 


so that Ah = Ahg + hf, and so on. Thus Theorem 28 is proved. 

In the basis constructed by means of Theorem 28, there corresponds to 
the transformation A not the original matrix A = (a), but some new 
matrix B = (b}) which has a particularly simple form, called the Jordan 
form. Thus Theorem 28 is a theorem on the reduction of a matrix to Jordan 
form. Let us examine this question in detail. 


34] THE JORDAN FORM OF A MATRIX 295 


(B) By a Jordan block of the mth-order with eigenvalue \ we shall 
mean a square matrix (g;) of mth-order which is defined by the relations 


gi =r, a= 1,...,m; gina = I, t=1,...,m—1; 
g=0, gG-i<0 and j-—i>]; 


ie., the matrix 


»10--- 00 
0x1 0 
002r--- 00 
000-1 
000-:- 0» 


It turns out that for every nth-order square matrix A, a nonsingular square 
matrix S may be chosen such that the matrix B = SAS! which is ob- 
tained from the matrix A by a transformation of the matrix S is of Jordan 
form, i.e., consists of one or more Jordan blocks located along its principal 
diagonal, while all the elements not contained in the Jordan blocks are 
equal to zero. 

Let us prove this. Let be an n-dimensional vector space and A a 
linear transformation corresponding (in a certain coordinate system) to 
the matrix A. Now let f,,...,f, be a basis of the space F consisting of 
series (see Theorem 28). We shall assume that the vectors f;,...,f, 
are arranged in such an order that the vectors of each series follow one 
another in the sequence f,,...,fn,. We shall denote by B = (bi) the 
matrix of the transformation A in the basis f,,...,f,. Let 


h, = f,, sey hy, = fn 


be the first series appearing in the sequence f,,...,f,, and let > be the 
corresponding eigenvalue. Then, as follows directly from the definition 
of a series, we have 


bi = 2, ta=1,...,m; bia, = 1, a=1,...,m—1; 
b=0, it1<j<m and i>j<m. 


Thus the first Jordan block in the matrix B corresponds to the first series 
of the sequence f,,...,f,. In exactly the same way the second Jordan 
block in the matrix B corresponds to the second series of the basisf;,..., fz, 
and so on. Since the matrix A can be transformed into B [see §32, (A)], 
we have B = SAS!, which proves proposition (B). 
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condenser, 81 
conjugate vectors, 47 
continuity of solutions, 170 ff 

in the initial values, 178 ff, 198 ff 
continuous dependence on parameters, 

170 ff, 193 ff 

contraction mappings, 151 
current, 81 
current source, 83 
cycle, 106 


damping, 89 

degenerate node, 121, 123 

differentiability of solutions, 170 ff, 
197 ff, 223 

differentiation of a determinant, 131 

differentiation operator, 43 

differentiation with respect to a 
system, 204 

domain of definition, 2, 19 


eigenvalue, 95, 277 

eigenvector, 95, 277 

electrical circuits, 80 ff 

electron tube, 84 

elementary divisors, 102 

elimination method, 67 

equilibrium state, 106, 201 ff, 231, 
244 ff 

equivalent equations, 145 

Euler’s formulas, 36 

existence theorems, 3, 18, 36, 150 ff 


filter, 93 

first integral, 181 

first-order equations, 1] 

focus, 119, 257 

forced oscillations, 79 

free term, 22, 39, 127 

frequency, 31, 76 

fundamental matrix, 130, 144 

fundamental system of solutions, 44, 
49, 51, 98, 127, 129, 139 

fundamental theorem of algebra, 2-3 


global theorems, 150, 192 ff 


Hadamard’s lemma, 172 

half-life, 6 

Hamiltonian system, 188 
harmonic oscillator, 30, 76, 79 
homogeneous equation, 11, 39, 42 
homogeneous function, 10 
Hurwitz’ theorem, 58 


impedance, 86 
indefinite integral, 7 


296 


INDEX 


inductance, 82 

inductor, 81 

initial conditions, 3 

initial values, 3, 20 

continuity and differentiability in, 

178 ff, 198 ff, 223 

integral curve, 4, 23 

integral equation, 151 

integral theorems, 150, 192 ff 

interval of definition; see domain of 
definition 

invariants, complete system of, 146 

isolated solution, 221 

iteration, 275 


Jordan form, 95, 97, 291 ff 


Kirchhoff’s laws, 84, 238 
Kronecker symbol, 68 


lagging argument, 105 
Lagrange’s formula, 161 
length (of a series or basis set), 102 
limit cycle, 220 ff 
criterion for existence of, 227 
linear dependence, 47, 128, 138 
linear differential equations, 39 
with constant coefficients, 41 ff 
with variable coefficients, 127 ff 
linearization, 209, 240 
Liouville’s formula, 131, 143, 273 
Lipschitz condition, 19, 155-156, 161 
load resistance, 93 
local theorems, 150, 170 
loop currents, 85 
Lyapunov function, 204, 241, 265 
Lyapunov stable, 202, 213, 261, 272 
Lyapunov’s theorem, 146, 201 ff, 264 


matrix form of linear equations, 134 

matrix function, 284 

maximum interval of existence, 20, 
106, 127, 189 

minimal annihilating polynomial, 277 ff 

multiple roots, 50 

multiplicity of a trajectory, 275 

mutual induction, 83, 238 


natural frequency, 90 
natural oscillation, 80 
nodal voltages, 86 
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node, 117 

nonautonomous system, 188 

nonhomogeneous equation, 11, 62 

nonuniformity of performance, 219 

nonuniqueness, 17 

norm, 152, 162 

normal system, 19, 22, 25, 94, 159, 
190, 201, 234 

normalizable system, 74 


w-limit point, 227 
w-limit set, 227 

Ohm’s law, 81 

open circuit, 88 
operational notation, 41 
order of a system, 26, 27 
oscillatory loop, 87 


parameters 
dependence on, 170 ff, 178 ff, 193 ff 
differentiability in terms of, 170 ff, 

197 ff 

partial differential equation, 186 

passive element, 83 

pendulum, 31 

period, 107 

periodic coefficients, 144 

periodic solution, 106, 221, 229, 231, 
261 ff 

phase, 30 

phase plane, 115, 221 

phase space, 103 ff, 108, 191 

phase trajectory, 109, 116 

phase velocity, 109 

Picard’s approximation theorem, 3, 
150 ff 

Poincaré, 220 

positive definite, 205 

principal minor, 60 


quadratic form, 205 
quadratures, 6 
quasipolynomial, 62 


radioactive decay, 5 

reduction to normal system, 25 
reduction of order, 136, 142 
resistance, 81 

resistor, 81 

resonance, 79, 91 

rough cycle, 226, 235 
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saddle, 118, 244, 246 

semistable limit cycle, 221, 235 

semistable state, 112 

separation of variables, 9, 11, 24 

series, 291 

shift formula, 51 

singular solutions, 30 

solution of a differential equation, 2 

spectrum of a matrix, 284 

spiral, 120, 232 

stability, 41, 200 ff, 219, 261 ff 

stable focus, 120 

stable limit cycle, 221 

stable node, 117, 244, 254 

stable polynomial, 57 

stable solution, 78, 111 

steady-state process, 78, 94 

steam engine, 213 ff 

succession function, 223, 224, 226, 
234, 236 

succession mapping, 274 

successive approximations, 151, 158 

successor, 223 

symbolic notation, 42 

system of equations, 26 


three-terminal element, 236 ff 

torus, 114 

trajectory, 105, 189, 221, 245, 253 ff 
transformation of variables, 99 
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transformer, 91 
transformer ratio, 93 
transient process, 78 
triangular matrix, 65 
triode, 236 
characteristic of, 236 
two-termina]l element, 81 


undetermined coefficients, 70, 102 
uniform convergence, 152, 162 
uniqueness theorems, 18 

unstable focus, 120, 244 

unstable node, 118, 244 


vacuum-tube oscillator, 236 ff 
Vandermonde’s determinant, 45 
variation of parameters, 12, 133, 141 
variational equations, 174, 180 
vector, 46, 160 
voltage drop, 81 
voltage source, 83 
Vyshnegradskiy, 213 ff 

theses of, 220 


Watt, 213 
Witt, 264 
Wronskian, 130, 132, 140 


w-limit point, 227 
w-limit set, 227 


